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METHODS AND COMPOSITIONS FOR MODULATION OF VESICULAR RELEASE 

5 This work was supported in part by NIMH Grant 5R37MH38710 and NIH Grant 

2P50MH48108. Accordingly, the United States Government has certain rights in this invention. 

Field of the Invention 

The present invention relates to methods of identifying compounds capable of modulating 
10 vesicular release. In particular, the invention relates to (i) methods of identifying such compounds 
employing the secretion associated 17S (SA-17S) complex, and (ii) the proteins comprising the SA- 
17S complex. 
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Back ground of the Invention 

Signal transmission between nerve cells typically involves the release of neurotransmitter 
from a presynaptic cell onto a postsynaptic cell. The neurotransmitter in the presynaptic cell is 
contained in synaptic vesicles positioned above the release sites at the presynaptic membrane (active 
5 zones). In response to a release signal (typically a local influx of calcium due to a depolarization of 
the presynaptic terminal), the vesicles undergo a series of mobilization steps culminating in the 
fusion of the vesicles with the presynaptic terminal membrane, and a dumping of vesicle contents 
into the synaptic cleft. 

The neurotransmitter molecules diffuse across the synaptic cleft and bind to corresponding 
10 receptors in the postsynaptic membrane to communicate the appropriate signal (typically a 

depolarization or hyperpolarization of the postsynaptic membrane) to the postsynaptic cell. Much of 
the neurotransmitter in the synapse is subsequently re-absorbed by the presynaptic cell through 
specific transmitter uptake mechanisms. 

A number of drugs affecting signalling in the central nervous system (CNS) and the 
15 peripheral nervous system (PNS) have been developed. Some of these, such as phenoxybenzamine, 
block specific post-synaptic receptors; others, such as clonidine and diethylamide, stimulate such 
receptors; still others (e.g., desipramine, imipramine) act on reuptake mechanisms, and some act on 
neurotransmitter synthesis (e.g., a-Methyltyrosine, p-Chlorophenylalanine) or degradation (e.g., 
monoamine oxidase inhibitors, iproniazid, pargyline). 
20 The present invention provides a tool for the screening and identification of drugs capable of 

affecting secretory processes, such as neurotransmitter release at the active zones of presynaptic 
membranes. 



Summary of the Invention 

25 In one aspect, the present invention includes a method of identifying a compound capable of 

modulating secretion of secretory vesicles. The method includes contacting an SA-17S complex with 
an SC complex, in the presence and absence of a test compound, measuring the effect of the test 
compound on the extent of binding between the SA-17S and the SC complexes, and identifying the 
compound as effective if its measured effect on the extent of binding is above a threshold level. 

30 In one embodiment, the SA-17S complex binds to the 7S SC complex. In another 

embodiment, the SA-17S complex binds to the 20S SC complex. 

The test compound may be effective to enhance (potentiate) or inhibit binding between the 
SA-17S and SC complexes. Compounds tested may include small molecules in a small molecule 
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combinatorial library, peptides (e.g., peptides in a peptide combinatorial library), and the like. A 
compound which enhances binding between the SA-17S and SC complexes may be used to treat an 
affective disorder (such as depression, manic-depressive disorders and anxiety disorders) or a 
neurodegenerative disease (such as Parkinson's disease or Huntington's disease). A compound 
5 which inhibits binding between the SA-17S and SC complexes may be used to treat a disorder of 
thought, such as schizophrenia. 

In another aspect, the invention includes a substantially purified SA-17S p71 polypeptide, 
having a molecular weight of about 71 kD and having the sequence of SEQ ID NO:54. 

In another aspect, the invention includes a substantially purified SA-17S p79 polypeptide, 
10 having a molecular weight of about 79 kD and having the sequence of SEQ ID NO:56. 

In another aspect, the invention includes a substantially purified SA-17S p84 polypeptide, 
having a molecular weight of about 84 kD and having the sequence of SEQ ID NO:58. 

In another aspect, the invention includes a substantially purified SA-17S p96 polypeptide, 
having a molecular weight of about 96 kD and having the sequence of SEQ ID NO:60. 
15 In another aspect, the invention includes a substantially purified SA-17S pl02 polypeptide, 

having a molecular weight of about 102 kD and having the sequence of SEQ ID NO:62. 

In another aspect, th^ invention includes a substantially purified SA-17S pl06 polypeptide, 
having a molecular weight of about 106 kD and characterized by at least one sequence selected from 
the group consisting of SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID 
20 NO:49, SEQ ID NO:50, SEQ ID NO:51, and SEQ ID NO:52. 

In another aspect, the invention includes a substantially purified polypeptide having (i) at 
least 60% sequence identity with a selected one of the SA-17S polypeptides p71, p79, p84, p96, 
pl02, and pl06, and (ii) the property of the selected polypeptide to form an SC-binding SA-17S 
complex when combined stoichiometrically with the other polypeptide components of the SA-17S 
25 complex. 

In another aspect, the invention includes a substantially purified polynucleotide having the 
sequence SEQ ID NO:53 encoding the SA-17S p71 polypeptide. 

In another aspect, the invention includes a substantially purified polynucleotide having the 
sequence SEQ ID NO:55 encoding the SA-17S p79 polypeptide. 
30 In another aspect, the invention includes a substantially purified polynucleotide having the 

sequence SEQ ID NO:57 encoding the SA-17S p84 polypeptide. 

In another aspect, the invention includes a substantially purified polynucleotide having the 
sequence SEQ ID NO:59 encoding the SA-17S p96 polypeptide. 
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In another aspect, the invention includes a substantially purified polynucleotide having the 
sequence SEQ ID NO:61 encoding the SA-17S pl02 polypeptide. 

In another aspect, the invention includes a substantially purified polynucleotide (i) having at 
least 60% sequence identity with a selected one of the polynucleotides encoding p71, p79, p84, p96, 
5 and pl02, and (ii) encoding a polypeptide having the property of the polypeptide encoded by the 
selected polynucleotide to form an SC-binding SA-17S complex when combined stoichiometrically 
with other polypeptide components of the SA-17S complex. 

In another aspect, the invention includes substantially purified antibodies, such as polyclonal 
or monoclonal antibodies, which are specifically-immunoreactive with one of the proteins or 
10 polypeptides described above. 

These and other objects and features of the invention will become more fully apparent when 
the following detailed description is read in conjunction with the accompanying drawings. 

Brief Description of the Figures 
15 Fig. 1 A shows a Coomassie blue-stained protein profile of glycerol gradient fractions from 

rat brain supernatant separated on a linear glycerol gradient, and analyzed by SDS-PAGE followed 
by Western blotting, with positions of rsec8, rsec6 and nsecl on the gel indicated on the right as 8, 
6, and 1 respectively. Fig. IB shows immunodetection of rsec6, rsec8 and nsecl in gradient 
fractions using immunoaffmity-purified antibodies, with size markers indicated by arrows. Fig. 1C 
20 shows co-immunoprecipitation of rsec6 and rsec8. 

Figs. 2A, 2B and 2C show the purification of rat brain SA-17S complex. Fig. 2A shows 
Coomassie blue-stained SDS polyacrylamide gel analysis of pooled column fractions. Fig. 2B shows 
a Western blot analysis of the column fractions shown in Fig. 2A. Fig. 2C shows a Coomassie 
blue-stained SDS polyacrylamide gel analysis of purified SA-17S complex. 
25 Figs. 3A, 3B, 3C and 3D show mass spectrometric analyzes of SA-17S complex peptides 

pl02, p96, p71 and p84, respectively, with the mass of each peptide predicted by mass spectrometry 
shown above the peptide peak. The amino acid sequence of each peptide determined by peptide 
sequencing and its mass (protonated form of peptide) calculated from the determined peptide 
sequence is shown beside corresponding peptide peak. 
30 Figs. 4A-4H show peptide sequences from the following SA-17S complex peptides, 

respectively: rsec8, pl06, pl02, p96, rsec6, p84, p79, and p71. 

Fig. 5A shows a Western blot analysis of brain regional expression of the SA-17S complex. 
Fig. 5B shows the presence of the SA-17S complex in cultured cell lines, indicated. 
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Fig. 6A shows a Western blot analysis of subcellular fractionation of the SA-17S complex. 
Fig. 6B shows association of the SA-17S complex with brain membranes as analyzed by Western 
blot Fig. 6C shows the nature of rsec8 association with brain membranes. Fig. 6D shows 
immunoprecipitation of synaptic vesicles from rat brain supernatant with anti-SV2 antibody or 

5 control mouse immunoglobulin coupled to Dynabeads M-500. 

Figs. 7A-D shows cultures of primary hippocampal neurons fixed and immunostained with 
antibodies against rsec8 (Figs. 7A and 7B) and synaptotagmin (Fig 7 C). Fig. 7D shows a larger 
magnification of rsec8 staining overlaid with that of synaptotagmin at two synaptic terminals 
indicated by arrows. Bars = 7^m. 

10 Fig. 8A is a Western blot analysis using seven monoclonal antibodies generated against 

rsec8 showing that syntaxin associates with the SA-17S complex. Fig. 8B shows a 
co-immunoprecipitation of syntaxin with rsec8. 

Rrief Descri ption of the Sequences 
15 SEQ ID NO:l is the nucleotide sequence of rsec6. 

SEQ ID NO:2 is the amino acid sequence of rsec6. 
SEQ ID NO: 3 is the nucleotide sequence of rsec8. 
SEQ ID NO:4 is the amino acid sequence of rsec8. 
SEQ ID NO:5 is the amino acid sequence of peptide fragment rsec6.1. 
20 SEQ ID NO:6 is the amino acid sequence of peptide fragment rsec6.2. 

SEQ ID NO:7 is the amino acid sequence of peptide fragment rsec6.3. 
SEQ ID NO: 8 is the amino acid sequence of peptide fragment rsec8.1. 
SEQ ID NO:9 is the amino acid sequence of peptide fragment rsec8.2. 
SEQ ID NO: 10 is the amino acid sequence of peptide fragment rsec8.3. 
25 SEQ ID NO:l 1 is the amino acid sequence of peptide fragment p71.1. 

SEQ ID NO: 12 is the amino acid sequence of peptide fragment p71.2. 
SEQ ID NO: 13 is the amino acid sequence of peptide fragment p7L3. 
SEQ ID NO: 14 is the amino acid sequence of peptide fragment p79.1. 
SEQ ID NO: 15 is the amino acid sequence of peptide fragment p79.2. 
30 SEQ ID NO: 16 is the amino acid sequence of peptide fragment p79.3. 

SEQ ID NO: 17 is the amino acid sequence of peptide fragment p79.4. 
SEQ ID NO: 18 is the amino acid sequence of peptide fragment p79.5. 
SEQ ID NO: 19 is the amino acid sequence of peptide fragment p79.6. 
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SEQ ID NO:20 is the amino acid sequence of peptide fragment p79.7. 
SEQ ID NO:21 is the amino acid sequence of peptide fragment p79.8. 
SEQ ID NO:22 is the amino acid sequence of peptide fragment p79.9. 
SEQ ID NO:23 is the amino acid sequence of peptide fragment p79.10. 
5 SEQ ID NO:24 is the amino acid sequence of peptide fragment p84.1. 

SEQ ID NO:25 is the amino acid sequence of peptide fragment p84.2. 
SEQ ID NO:26 is the amino acid sequence of peptide fragment p84.3. 
SEQ ID NO:27 is the amino acid sequence of peptide fragment p84.4. 
SEQ ID NO:28 is the amino acid sequence of peptide fragment p84.5. 
10 SEQ ID NO:29 is the amino acid sequence of peptide fragment p84.6. 

SEQ ID NO:30 is the amino acid sequence of peptide fragment p84.7. 
SEQ ID NO:31 is the amino acid sequence of peptide fragment p84.8. 
SEQ ID NO:32 is the amino acid sequence of peptide fragment p84.9. 
SEQ ID NO:33 is the amino acid sequence of peptide fragment p96.1. 
15 SEQ ID NO:34 is the amino acid sequence of peptide fragment p96.2. 

SEQ ID NO:35 is the amino acid sequence of peptide fragment p96.3. 
SEQ ID NO:36 is the amino acid sequence of peptide fragment p96.4. 
SEQ ID NO:37 is the amino acid sequence of peptide fragment p96.5. 
SEQ ID NO:38 is the amino acid sequence of peptide fragment pl02.1. 
20 SEQ ID NO:39 is the amino acid sequence of peptide fragment pl02.2. 

SEQ ID NO:40 is the amino acid sequence of peptide fragment pl02.3. 
SEQ ID NO:41 is the amino acid sequence of peptide fragment pl02.4. 
SEQ ID NO:42 is the amino acid sequence of peptide fragment pl02.5. 
SEQ ID NO:43 is the amino acid sequence of peptide fragment pl02.6. 
25 SEQ ID NO:44 is the amino acid sequence of peptide fragment pl02.7. 

SEQ ID NO:45 is the amino acid sequence of peptide fragment pl06.1. 
SEQ ID NO:46 is the amino acid sequence of peptide fragment pl06.2. 
SEQ ID NO:47 is the amino acid sequence of peptide fragment pl06.3. 
SEQ ID NO:48 is the amino acid sequence of peptide fragment pl06.4. 
30 SEQ ID NO:49 is the amino acid sequence of peptide fragment pl06.5. 

SEQ ID NO:50 is the amino acid sequence of peptide fragment p!06.6. 
SEQ ID NO:51 is the amino acid sequence of peptide fragment pl06.7. 
SEQ ID NO:52 is the amino acid sequence of peptide fragment pl06.8. 
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SEQ ID NO:53 is the nucleotide sequence encoding p71 (rseclO). 

SEQ ID NO:54 is the amino acid sequence of p71 (rseclO). 

SEQ ID NO:55 is the nucleotide sequence encoding p79 (rexo70). 

SEQ ID NO:56 is the amino acid sequence of p79 (rexo70). 
5 SEQ ID NO:57 is the nucleotide sequence encoding p84 (rexo84). 

SEQ ID NO:58 is the amino acid sequence of p84 (rexo84). 

SEQ ID NO:59 is the nucleotide sequence encoding p96 (rsecl5). 

SEQ ID NO:60 is the amino acid sequence of p96 (rsecl5). 

SEQ ID NO:61 is the nucleotide sequence encoding pl02 (rsec5). 
10 SEQ ID NO:62 is the amino acid sequence of pl02 (rsecS). 



Detailed Description of the Invention 

I. Definitions 

"Substantially purified" refers to the at least partial purification of a selected polynucleotide, 
15 polypeptide, or related compound away from unrelated or contaminating components (e.g., serum 
cells, other proteins). 

Whin a first polynucleotide fragment or polypeptide fragment is said to "correspond to" a 
second polynucleotide fragment or polypeptide fragment, respectively, it means that the fragments or 
regions are essentially co-extensive with one another when the sequences representing the fragments 
20 are aligned using a sequence alignment program, such as "MACVECTOR" (IBI, New Haven, CT). 
"Corresponding" polynucleotide or polypeptide fragments typically contain a similar, if not identical, 
number of residues. It will be understood, however, that corresponding fragments may contain 
insertions or deletions of residues with respect to one another, as well as some differences in their 
sequences. 

25 The term "significant", when used with reference to "significantly different", "significantly 

inhibits" or "significantly stimulates", refers to a difference in a quantifiable parameter between the 
two groups being compared that is statistically-significant using standard statistical tests. For 
example, the degree of binding in a protein binding assay may be quantified using standard methods, 
and the degree of binding under different conditions can be compared for statistically-significant 

30 differences. 

An antibody or antibody composition (e.g., polyclonal antibodies) is "specifically 
immunoreactive" with a selected protein when the antibody or antibody composition is not reactive 
with antigens typically present in normal sera, not exposed to the selected protein. 
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A "syntaxin-containing complex", or "SC complex", refers to a complex formed of one or 
more proteins, where one of the proteins is syntaxin (Bennett, et ai, 1993; Scheller, 1995). 
Exemplary SC complexes include the 7S complex (comprising VAMP, synaptotagmin, syntaxin and 
SNAP-25) and the 20S complex (comprising VAMP, syntaxin, SNAP-25, a-SNAP and NSF). 

A polypeptide is "characterized by" a selected sequence when the polypeptide contains a 
sequence that is identical or substantially identical to the selected sequence. 

"SNAP-25" refers to synaptosomal-associated protein of 25 kDa (Pevsner, et al, 1994a; 
Scheller, 1995). 

"a-SNAP" refers to soluble NSF attachment protein (Pevsner, et at,, 1994a; Scheller, 

1995). 

"NSF" refers to N-ethylmaleimide-sensitive factor (Pevsner, et al, 1994a; Scheller, 1995). 
"VAMP" refers to vesicle-associated membrane protein (Pevsner, etal, 1994a; Scheller, 

1995). 

II. Analysis. Isolation and Purification of the SA-17S Complex 

Experiments performed in support of the present invention and described in detail below 
have resulted in the identification of proteins that are involved in both regulated and constitutive 
exocytosis. Specifically, it has been discovered that in mammalian brain, the proteins rsec6 and 
rsec8, together with p71, p79, p84, p96, pl02 and pl06 proteins, are present in a high molecular 
weight complex, termed secretion-associated 17S (SA-17S) complex. 

Analysis of the rat brain SA-17S complex revealed that it contains both rsec6 and rsec8. As 
described in Example 1, soluble brain proteins were fractionated on a linear glycerol gradient and 
the gradient fractions were analyzed by both sodium dodecyl sulfate polyacrylamide gel 
electrophoresis (SDS-PAGE) and Western blotting. A Coomassie blue-stained protein profile of the 
fractions is shown in Figure 1A. The migration of rsec6 and rsec8 in the gradient was monitored by 
Western blot analysis using affinity-purified anti-rsec6 and anti-rsec8 antibodies. The results are 
shown in Figure IB. Both rsec6 and rsec8 migrated at the 17S position. To determine if the 
co-fractionating rsec6 and rsec8 were in the same complex, rsec6 and rsec8 were 
immunoprecipitated from polyethylene glycol-fractionated rat brain supernatant with affinity-purified 
antibodies against rsec8 or with control rabbit immunoglobulin. The immunoprecipitated sample 
was analyzed for the presence of rsec6, rsec8 and nsecl by Western blotting. The results, shown in 
Figure 1C, demonstrate that rsec6, but not nsecl, co-immunoprecipitated with rsec8 from soluble 
brain proteins using affinity-purified rabbit polyclonal anti-rsec8 antibodies. Taken together, these 
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data demonstrate that both rsec6 and rsec8 are components of the same high molecular weight 
complex. 

Experiments detailed in Example 2 describe the purification of the SA-17S complex, and 
show that the complex is formed of 8 separate proteins or polypeptides. The SA-17S complex was 
5 purified from frozen rat brain by four sequential column chromatographic steps. The enrichment of 
the SA-17S complex following each chromatographic step was monitored by Western blot analysis 
using mouse polyclonal antibodies against rsec6 and rsec8, as shown in Figure 2B. After 
completing the steps detailed in Example 2, eight polypeptides of molecular weights 71, 79, 84, 86, 
96, 102, 106, and 110 kDa were apparent in a Coomassie blue stained SDS-PAGE gel. These 
10 proteins were termed SA-17S p71, SA-17S P 79, SA-17S P 84, SA-17S p86, SA-17S P 96, SA-17S 
pl02, SA-17S pl06, and SA-17S pi 10, respectively. Subsequent peptide sequence analyses, 
described below, determined that SA-17S pi 10 was rsec8 and SA-17S p86 was rsec6. SDS-PAGE 
analysis of pooled peak rsec6 and rsec8 immunoreactivity fractions indicated that these eight proteins 
constituted at least 95% of the total proteins in this fraction (Figure 2A, lane TMAE2; Figure 2C). 
15 Fractionation of the purified complex on a glycerol gradient demonstrated that all eight 

components co-migrate at the 17S position. In addition, all eight proteins co-immunoprecipitated 
with rsec8 monoclonal antibodies. Further, all eight proteins were approximately equally stained by 
Coomassie blue, suggesting a stoichiometry of one copy of each protein within the complex (Figure 
2C). This 1:1 stoichiometry is further supported by the apparent molecular weight of the native 

20 complex (-600-700 kD). 

The SA-17S is also unusually stable, with monomeric forms of the proteins forming the 
complex never observed under non-denaturing conditions. Unlike the 20S particle composed of 
syntaxin, VAMP, SNAP-25, a-SNAP and NSF, the SA-17S complex does not dissociate in response 
to ATP hydrolysis. In fact, this complex does not dissociate in the presence of ATP, GTP, ATP7S, 

25 GTP7S, EGTA and/or Ca 2+ . The mammalian SA-17S complex is found in soluble and 
membrane-bound states. 

III. Polypeptides Contained in the SA-17S Complex 

To further characterize individual components of the mammalian SA-17S complex, the 
30 purified complex was fractionated on an SDS polyacrylamide gel and each protein band was cut out 
and individually subjected to in-gel proteolysis as described in Example 3, to generate peptides 
which could be further characterized by high performance liquid chromatography (HPLC), mass 
spectrometry for peptide purity and mass, and peptide sequencing. 
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Figures 3 A, 3B, 3C and 3D show the mass spectrometric analyzes of HPLC peptide peaks 
from four proteins, SA-17S pl02, SA-17S p96, SA-17S P 84 and SA-17S p71, respectively, with the 
mass of each peptide predicted by mass spectrometry is shown above the peptide peak. The amino 
acid sequence of each peptide determined by peptide sequencing and its mass (protonated form of 
5 peptide) calculated from the determined peptide sequence is shown beside corresponding peptide 
peak, along with its SEQ ID NO. In Panels A to C the calculated and observed masses matched 
with less than 0.1 % difference. Peptide sequences obtained from the eight members of the SA-17S 
complex are shown in Figures 4A-H, as well as Tables 1-8, below. 

These sequences, as well as the molecular weights of the polypeptides, are effective to 
10 uniquely characterize the 8 proteins which form the SA-17S complex. In this respect, the invention 
includes the following substantially purified polypeptides: (i) SA-17S p71 polypeptide, having a 
molecular weight of about 71 kD and characterized by at least one sequence selected from the group 
consisting of SEQ ID NO:ll, SEQ ID NO: 12 and SEQ ID NO: 13: (ii) SA-17S p79 polypeptide, 
having a molecular weight of about 79 kD and characterized by at least one sequence selected from 
15 the group consisting of SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID 
NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22 and SEQ ID NO:23; (iii) 
SA-17S p84 polypeptide, having a molecular weight of about 84 kD and characterized by at least 
one sequence selected from the group consisting of SEQ ID NO:24, SEQ ID NO:25, SEQ ID 
NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, and 
20 SEQ ID NO:32; (iv) SA-17S p96 polypeptide, having a molecular weight of about 96 kD and 
characterized by at least one sequence selected from the group consisting of SEQ ID NO:33, SEQ 
ID NO:34, SEQ ID NO:35, SEQ ID NO:36 and SEQ ID NO:37; (v) SA-17S pl02 polypeptide, 
having a molecular weight of about 102 kD and characterized by at least one sequence selected from 
the group consisting of SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID 
25 NO:42, SEQ ID NO:43, and SEQ ID NO:44; and (v) SA-17S pl06 polypeptide, having a molecular 
weight of about 106 kD and characterized by at least one sequence selected from the group 
consisting of SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49. 
SEQ ID NO:50, SEQ ID NO:51, and SEQ ID NO:52. 

These peptides are useful for the formation of a complete SA-17S complex, which can be 
30 employed, e.g. , in screens such as the screen described below. The peptides may also be used to 
identify the site of interaction between the SA-17S and SC complexes, enabling the design of a 
simpler screen for compounds which modulate such an interaction. 
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A. Isolation of Full-length Clones Encoding S A-17S Polypeptides 
The peptide sequences provided in Tables 1-8, below, were used to isolate DNA or cDNA 
clones encoding the complete proteins, according to standard techniques. From these studies, 
polynucleotide sequences which encode full-length SA-17S polypeptides P 71, p79, P 84, P 96, and 
5 pl02. Based on amino acid sequence similarities, six of the eight components of the mammalian 
SA-17S complex have counterparts in the 834 kDa yeast Sec6/8/15 complex (TerBush and Novick, 

1995; TerBush, etal., 1996). 

The SA-17S p71 cDNA (SEQ ID NO:53) has an open reading frame (ORF) which encodes a 
708 amino acid polypeptide (SEQ ID NO:54) with a calculated molecular weight of 82 kdal. The 
10 P 71 polypeptide is homologous to yeast seclO and is thus denoted rseclO. The amino acid sequence 
contains potential coiled-coil domains at the N-terminus and near the C-terminus of the protein. In 
addition, rseclO is homologous with the tail domain of yeast type II myosin, and may interact with 
cytoskeletal proteins. 

The SA-17S p79 cDNA (SEQ ID N0.55) has an ORF which encodes a 653 amino acid 
15 protein (SEQ ID NO:56) with a calculated molecular weight of 75 kDa. The p79 polypeptide 

sequence shows 23% identity and 33% similarity to yeast exo70 and is thus denoted rexo70. Using 
the Coils 2.1 program, a region of 30 amino acids (residues 5-34) at the amino terminus of the 
protein is predicted to have a high probability (between 0.88 and 1.0) of forming a coiled-coil 
domain, a structure which is known to be involved in protein-protein interactions. 
20 The SA-17S p96 cDNA (SEQ ID NO:59) has an ORF which encodes an 822 amino acid 

protein (SEQ ID NO:60) with a calculated molecular weight of 95 kDa. The p96 protein sequence 
shows 24% identity and 36% similarity to yeast secl5 and is designated rsecl5. A region of 22 
amino acids (residues 85-106) at the amino terminus of the protein has a probability of greater than 
0.59 to form a coiled-coil domain. 
25 The SA-17S pl02 cDNA (SEQ ID NO:61) has an ORF which encodes a 924 amino acid 

protein (SEQ ID NO:62) with a calculated molecular weight of 104 kDa. The pl02 protein 
sequence shows 21% identity and 32% similarity to yeast sec5, and is denoted rsec5. A region of 
21 amino acids (residues 240-260) at the amino terminus of the protein has a high probability 
(greater than 0.94) to form a coiled-coil domain. 
30 The cDNA sequence of the SA-17S P 84 subunit (SEQ ID NO:57) was identified based on 

overlapping rat cDNA clones which encode the peptide sequences derived from the p84 subunit 
(SEQ ID NO:24-32). The predicted protein (SEQ ID NO:58) is designated rexo84 since it 
corresponds to the 84 kDa protein in the complex. Like other proteins of the SA-17S complex, 
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rexo84 contains regions predicted to form coiled-coils. The rexo84 sequence is not significantly 
similar to any of the known components of the 834 kDa yeast Sec6/8/15 complex. 

All the above-identified members of the mammalian SA-17S complex, as well as rsec6 and 
rsec8, contain regions predicted to form coiled-coil domains, suggesting that subunits in this 
5 complex interact with each other at least in part via these domains. Predicted coiled-coil domains 
are prominent in other membrane trafficking proteins including the SNAREs, suggesting these 
interactions may be involved in inter-complex interactions as well. 



B. Recombinant Production of the SA-17S Proteins 
10 Polynucleotide sequences encoding proteins of the SA-17S complex may be cloned into an 

expression plasmid, such as a p-GEX plasmid (Pharmacia), to produce corresponding polypeptides. 

Recombinant pGEX plasmids can be transformed into appropriate strains of E. coli and fusion 

protein production can be induced by the addition of IPTG (isopropyl-thio galactopyranoside). 

Solubilized recombinant fusion protein can then be purified from cell lysates of the induced cultures 
15 using glutathione agarose affinity chromatography according to standard methods (Ausubel, a ai, 

1992). 

Affinity chromatography may also be employed for isolating 0-galactosidase fusion pro'eins 
(such as those produced by lambda gtll clones). The fused protein is isolated by passing cell lysis 
material over a solid support having surface-bound anti-/3-galactosidase antibody. 

20 Isolated recombinant polypeptides produced as described above may be purified by standard 

protein purification procedures. These procedures may include differential precipitation, molecular 
sieve chromatography, ion-exchange chromatography, isoelectric focusing, gel electrophoresis and 
affinity chromatography. 

In addition to recombinant methods, SA-17S complex proteins or polypeptides can be 

25 isolated from selected cells by affinity-based methods, such as by using appropriate antibodies 

(described below). Further, SA-17S complex peptides may be chemically synthesized using methods 
known to those skilled in the art. 



C. Antibodies Directed Agains t SA-17S Polypeptides 
30 SA-17S polypeptides of the present invention, such as polypeptides identified by sequences 

SEQ ID NO:5-52, as well as full-length polypeptides corresponding to one of the eight protein 
components of SA-17S, such as polypeptides identified by sequences SEQ ID NO:2, 4, 54, 56, 58, 
60, and 62, and fragments thereof, may be used in the generation of antibodies, e.g., as described i: 
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the Materials and Methods below. The polypeptides may be used in unmodified form, or they may 
be coupled to appropriate carrier molecules, such as bovine serum albumin (BSA) or Keyhole 
Limpet Hemocyanin (KLH) (available from, for example, Pierce, Rockford, IL). 

To prepare antibodies, a host animal, such as a mouse or rabbit, is typically immunized with 
5 the purified polypeptide, or polypeptide coupled to carrier or fusion protein (e.g., a His-tagged 
fusion protein or a glutathione-S-transferase fusion). The host serum or plasma is collected 
following an appropriate time interval, and the serum is tested for antibodies specific against the 
polypeptide. 

The gamma globulin fraction or the IgG antibodies of immunized animals can be obtained, 

10 for example, by use of saturated ammonium sulfate precipitation or DEAE Sephadex 

chromatography, affinity chromatography, or other techniques known to those skilled in the art for 
producing polyclonal antibodies. 

Alternatively, purified antigenic polypeptide or fused antigen protein may be used for 
producing monoclonal antibodies. In this case, the spleen or lymphocytes from an immunized 

15 animal are removed and immortalized or used to prepare hybridomas by methods known to those 
skilled in the art (e.g., Harlow, et al, 1988). For example, hybridomas may be prepared by fusion 
of NS-1 mouse myeloma cells with spleen cells from BAIB/c mice immunized with His-tagged 
fusion protein (Lane, et al, 1986). Antibodies secreted by the immortalized cells are screened 
(e.g., using enzyme linked immunosorbent assay (ELISA) or a Western blot) to determine the clones 

20 that secrete antibodies of the desired specificity (e.g., Ausubel, et al., 1992). 

If desired, the antibodies may be affinity-purified prior to use, using methods known in the 
art (e.g., Harlow, et al, 1988). Antibodies or FAb fragments thereof generated as described above 
may be used in a variety of ways. In particular, they can be used to detect or quantitate the level of 
SA-17S polypeptides in any of the methods of the present invention where such detection is 

25 contemplated, e.g., in methods employing Western-based detection approaches. Further, the 

antibodies can be used in screens of expression libraries, as described above. The antibodies may 
also be used to co-immunoprecipitate proteins which interact with the SA-17S complex, as described 
in the Materials and Methods and Example 7. 



30 IV. Expression Pattern of the SA-17S Complex 

Expression patterns of the SA-17S complex are described below in Examples 4-6. 
Peripheral rsec8 staining in synaptic terminals suggests that the SA-17S complex is associated w 
the synaptic plasma membrane. The presence of the SA-17S complex in hippocampal synapses, 
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together with its association with the synaptosomal protein syntaxin, indicate that the SA-17S 
complex plays a critical role in the synaptic vesicle trafficking pathway of yeast and mammals. 

V. Role of the SA-17S Complex in Secretion 

The SA-17S complex is required for normal secretion and synaptic release, and therefor, 
compounds which inhibit the association of the SA-17S complex with syntaxin or a syntaxin- 
containing complex will inhibit the release of the secretory vesicles. 

The results of the experiments described herein, including (i) the demonstration of an in vivo 
interaction between rsec8 and syntaxin as revealed by the immunoprecipitation of syntaxin with 
anti-rsec8 antibodies, and (ii) selective staining of neurons with anti-rsec8 antibodies in the area near 
the plasma membrane of the nerve terminals (the site of exocytosis), are in agreement with studies of 
yeast secretory mutants. The yeast studies have revealed genetic interactions between Sec8/Secl5 
and the yeast homologs of syntaxin and SNAP25, SSOl and Sec9. Specifically, overexpression of 
SSOl can suppress mutations in the SEC 15 gene (Aalto, et a/., 1993), while over-expression of Sec9 
can suppress mutations in both the SEC8 and SEC 15 genes (Brennwald and Novick, 1993). 
Additionally, staining of the Sec6/8/15 complex in yeast appears enhanced at the site of vesicle 
docking and fusioh in the tip of the bud. 

The data presented here are also consistent with a role for the SA-17S complex in a Rab 
mediated event at the plasma membrane, possibly intervening between the Rab and the 7S complex 
(Bowser, etaL % 1992). 

Expression of the SA-17S complex is not as high in neurons and endocrine cells as some of 
the isoforms of vesicle trafficking proteins involved in the synaptic vesicle docking and fusion 
pathway. Northern and Western blot analyses as well as fluorescence microscopy studies detect its 
expression in all tissues and cultured cell lines examined. This ubiquitous distribution of the SA-17S 
complex suggests that it is important in both constitutive and regulated membrane trafficking and 
that its function is fundamental to the exocytotic process. 

Although the SA-17S complex is described herein as pertaining primarily to neurotransmitter 
release, it is contemplated that the SA-17S complex is involved in secretory release in a number of 
other cell types, including mast cells. 

VL Screens for Inhibitors of Synaptic Protein Interactions 

The present invention includes methods of screening for compounds effective to modulate 
vesicular release involved in synaptic transmission and other secretory processes. The methods are 
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used to identify a compound capable of affecting binding of a secretion associated 17S (SA-17S) 
complex to a syntaxin-containing (SC) complex. An SA-17S complex is contacted with an SC 
complex, in the presence and absence of a test compound. The effect of the test compound on the 
extent of binding between the SA-17S and the SC complexes is measured, and a compound is 
5 identified as effective if its effect on the extent of binding is above a threshold level (e.g., a several- 
fold difference in binding level between control and experimental samples). 

The test compound may be effective to enhance (potentiate) or inhibit binding between the 
SA-17S and SC complexes. Compounds tested may include small molecules in a small molecule 
combinatorial library, peptides in a peptide combinatorial library, and the like. 
10 Compounds which affect the binding of the SA-17S and SC complexes to one another, when 

applied to target cells, are expected to modulate vesicular release by those cells. The modulation 
may be an inhibition of release or stimulation of release, either when the compound is applied alone, 
or when the compound is applied in conjunction with another compound having an effect on 
vesicular release. 

15 In one embodiment, the assay is conducted using a co-immunoprecipitation method such as 

is described in the Materials and Methods and Example 7. Anti-rsec8 monoclonal antibodies (such 
as antibodies 2E9 and/or 17A10) or mouse immunoglobulin are coupled to protein G beads at a final 
concentration of 2 mg/ml using the cross-linker dimethylpimelimidate (Pevsner, et al., 1994a). Rat 
brain membranes prepared as described herein are resuspended in 20 mM Tris, pH 8.0, 150 mM 

20 NaCl and solubiiized with either 1 % CHAPS or Triton X-100 at a final protein concentration of 2 
mg/ml. 

The solubiiized sample is incubated at 4°C for 30 min before centrifuging at 20,000 xg for 
20 min. Following centrifugation, the solubiiized brain membranes are pre-cleared by incubation 
with protein G beads (1 ml solubiiized membranes per 200 p\ protein G beads) for 30 min at 4°C. 
25 The pre-cleared supernatant is then incubated with (i) a selected concentration of the test compound 
and (ii) 20 pi protein G beads with either immobilized anti-rsec8 monoclonal antibodies 2E9/17A10 
or immobilized control mouse Ig for 4 hrs at 4°C. The beads are then washed three times with 300 
id 20 mM Tris, pH 8.0, 150 mM NaCl containing either 0.7% CHAPS or Triton X-100. Proteins 
bound to the beads are analyzed by Western blotting for the presence of syntaxin as described in the 
30 Materials and Methods and Examples below. 

If the test compound is effective to inhibit binding of an SC complex to the immobilized SA- 
17S complex, the amount of syntaxin detected in the Western blot will be diminished. Conversely, 
if the test compound is effective to enhance the binding of an SC complex to the immobilized SA- 
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IIS complex, the amount of syntaxin detected in the Western blot will be increased. The assay is 
typically conducted with both negative controls (e.g., beads coated with control mouse Ig) and 
positive controls (no test compound in assays with anti-rsec8 antibody). 

It will be appreciated that the co-immunoprecipitation assay may be practiced using other 
5 antibodies which recognize and can immunoprecipitate the SA-17S complex, such as anti-rsec6 
antibodies or antibodies directed against one or more of the other proteins in the complex. 

A. Measuring the Effect of a Test Compound on the Extent of Binding 

The type of measurement used to quantify the effect of a test compound on the extent of 
10 binding between the SA-17S complex and a SC complex depends on the type of screening assay and 
detection system used, and can be readily determined by one of skill in the art. 

For example, in a co-immunoprecipitation screen employing a Western blot detection, such 
as described above, the extent of binding may be measured using densitometry of the Western blot 
image. The densitometry values are typically normalized, and a threshold level is set based on the 
15 amount of variation in the signal between a series of "control" samples (samples not containing test 
compounds). The smaller the variation, the smaller the effect of a test compound that can be 
reliably detected. The threshold is typically set at a several-fold difference, such as a 3-5 fold 
increase or decrease in binding affinity. 

The precise threshold level used in a particular application of the invention is determined in 
20 view of the specific requirements of that particular application. For example, if it is desired to 

isolate only compounds with a very high activity, the threshold is set to a relatively high value, such 
as a 10 to 100-fold difference in binding affinity. If, on the other hand, it is desired to isolate 
compounds having a subtle effect on the binding, a lower threshold level may employed. 

It is also appreciated that, as with any other compound screening assay, the effect of a 
25 particular compound on the binding of a SA-17S complex to an SC complex depends on the 

concentration of the compound. At relatively high compound concentrations the effect may be large, 
and be manifested as, e.g., a 50-fold difference in binding between control and experimental 
samples, whereas at lower compound concentrations, the effect may be smaller. 

30 B. Effects of Identified Compounds on Vesicular Release 

Compounds identified by a screen such as one described above as affecting the binding of an 
SA-17S complex to an SC complex may be further evaluated for their ability to modulate vesicular 
release in vitro and in vivo. 
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For example, the compounds may be tested using the PC12 cell D/3H vesicular release assay 
(Bennett, et a/., 1993), which detects a membrane-associated form of the enzyme dopamine 0- 
hydroxylase (D0H) on the luminal side of catecholamine-containing granules. When the cells are 
depolarized in the presence of calcium, granule fusion with the plasma membrane results in the 

5 exposure of D/3H on the cell surface, where it can be quantitatively detected by immunofluorescence 
microscopy (Elferink, et ai, 1993). By treating a sample of cells with a compound identified as 
affecting the binding of an SA-17S complex to an SC complex, depolarizing the cells (e.g., with a 
pulse of KCi) in the presence of calcium, and comparing the response to that obtained with an 
untreated sample of cells, the effects of the compound on vesicle release in PC12 cells may be 

10 assessed. Similar assays may be employed using freshly-isolated cells (e.g., in brain slices), or 
suitable animal models. 

VII. Compounds Suitable for Screening 

A variety of different compounds may be screened using methods of the present invention. 
15 They include peptides, macromolecules, small molecules, chemical and/or biological mixtures, and 
fungal, bacterial, or algal extracts. Such compounds, or molecules, may be either biological, 
synthetic organic, or even inorganic con pounds, and may be obtained from a number of sources, 
including pharmaceutical companies and specialty suppliers of libraries (e.g., combinatorial libraries) 
of compounds. 

20 A set of potentially-effective test peptides can be generated from overlapping peptides 

spanning the entire sequence of each of the proteins involved in the SA-17S complex/SC complex 
interaction. Such a set is likely to contain peptides which may be effective to disrupt the interactions 
of the SA-17S complex with the SC complex. 

In cases where an identified active compound is a peptide, the peptide may be utilized to aid 

25 in the discovery of orally-active small molecule mimetics. 

VIIL Applications 

Inhibitory compounds isolated using methods of the present invention may be employed to 
inhibit vesicle-mediated secretion of molecules from cells. Similarly, compounds which enhance or 
30 potentiate the binding of SA-17S complex to syntaxin or an SC complex may be used to upregulate 
vesicle-mediated secretion. The ability to modulate secretion processes has utility in a variety of 
areas, some of which are identified below. 
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A. CNS Disease Applications 

A number of disorders and/or conditions of the central nervous system (CNS) may be 
alleviated by selectively enhancing or inhibiting vesicular release in specific areas of the brain. They 
include affective disorders (e.g., depression), disorders of thought (e.g., schizophrenia) and 
5 degenerative disorders (e.g., Parkinson's disease), as well as applications such as anesthesia. A 
variety of drugs are currently used to treat such disorders and/or conditions. Compounds identified 
by methods of the present invention may be used either alone, or in combination with currently used 
therapies to alleviate symptoms associated with the disorders. 

Drugs used to treat affective disorders, which include depression, manic-depressive disorders 
10 and anxiety disorders, typically fall into three classes: (i) monoamine oxidase (MAO) inhibitors, 
such as phenelzine, (ii) tricyclic compounds, such as imipramine, and (iii) serotonin uptake blockers, 
such as fluoxetine and trazodone. All of these drugs work, at least in part, by increasing the 
concentration of either serotonin or biogenic amine neurotransmitters in CNS synapses of treated 
individuals. According to methods of the present invention, compounds which enhance or stimulate 
15 the release of serotonin or biogenic amines at selected brain synapses may be similarly effective at 
treating affective disorders. Such compounds may be identified by screening for compounds which 
eriance the binding of SA-17S or its components with an SC complex. 

Disorders of thought, such as schizophrenia, have been treated with a variety of 
antipsychotic drugs (including phenothiazines, such as chlorpromazine, butyrphenones, such as 
20 haloperidol, xithioxanthenes, and newer drugs, such as clozapine) now known to act as blockers of 
dopamine receptors. According to the teachings presented herein, compounds identified as inhibitors 
of release of dopamine-containing vesicles, particularly vesicles released from cells having their cell 
bodies in the arcuate nucleus of the hypothalamus, the substantia nigra, or the ventral tegmental 
area, may be employed to relieve symptoms of schizophrenia. Such compounds may be identified 
25 using a screening assay such as described above, for compounds effective to inhibit the binding of 
SA-17S complex or its components with an SC complex. 

Neurodegenerative diseases, such as Parkinson's disease and Huntington's disease, may also 
benefit from compounds identified according to the methods of the present invention. Parkinson's 
disease arises due to degeneration of the nigrostriatal pathway, raphaei nuclei, locus ceruleus, and 
30 motor nucleus of vagus, which result in a reduction of dopamine, serotonin and norepinephrine 
levels. The symptoms of Parkinson's may be alleviated by administering compounds identified 
according to the teachings presented herein as stimulating release of vesicles containing the above 
neurotransmitters. 
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B. Other Applications 

In addition to applications in the CNS, compounds identified employing methods of the 
present invention may be used to therapeutically intervene in a variety of other systems. They 
include the endocrine system for treatment of hormonal imbalances, the immune system for 
5 intervention in antigen processing, secreted immunomodulators, and viral processing, as well as anti- 
tumor applications, such as regulation of membrane trafficking during rapid cell division. 

The following examples illustrate but in no way are intended to limit the present invention. 

Materials and Methods 

10 Unless otherwise indicated, [ 125 I]goat anti-rabbit secondary antisera and the Enhanced 

Chemo-Luminescence (ECL) system were obtained from Amersham Corp. (Arlington, Heights, IL). 
Nitrocellulose paper was obtained from Schleicher and Schuell (Keene, NH). Materials for SDS- 
polyacrylamide gel electrophoresis (SDS-PAGE) were obtained from Bio-Rad Laboratories 
(Hercules, CA). Lys C and Glu C were obtained from Waco. Other chemicals were purchased 

15 from Sigma (St. Louis, MO) or United States Biochemical (Cleveland, OH). 

Anti-rsec8 monoclonal antibodies (mAbs) may be generated as described herein, or may be 
purchased from commercial sources (e.g., Transduction Laboratories, Lexington, KY). 

All protein and protein complex purification procedures were carried out at 4°C unless 
otherwise noted. 

20 

A. Glycerol Gradient Analysis 

Rat brain supernatant was fractionated on a linear glycerol gradient (Ting, ex al., 1995) and 
analyzed by Western blotting. Rat brain supernatant was prepared by homogenization in 6 ml of 40 
mM Hepes (pH 7.4), 150 mM NaCl, 10% (vol/vol) glycerol, 1 mM EDTA, 1 mM dithiothreitol 

25 containing leupeptin, aprotinin, pepstatin, and phenylmethanesulfonyl fluoride each at 1 /xg/ml. The 
homogenate was centrifuged at 1000 xg for 5 min and the supernatant was centrifuged at 100,000 
xg for 20 min. Eighty microliters (40 /ig of protein) of the resulting supernatant was layered onto a 
1.2-ml ten-step 22.5-36% (vol/vol) glycerol gradient in 40 mM Hepes (pH 7.4), 150 mM NaCl, 1 
mM dithiothreitol. The gradient was centrifuged at 91,000 xg for 16 hours at 4°C. 

30 Fractions (95 fi\) were collected and analyzed by SDS/PAGE and Western blotting. 

Migration of brain rsec8, rsec6 and n-Secl was monitored by Western blotting using affinity-purified 
antibodies. 
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B. Purification of Rat Brain SA-17S Complex 

1. Bio-Gel HT Hvdroxvapatite Chromatography . Fifty frozen rat brains 
(Harlan, IN) were homogenized in 400 ml of homogenization buffer (20 mM Hepes, pH 7.4, 200 
mM NaCl, 1 mM dithiothreitol (DTT), 0.1 mM EDTA and 0.3 mM PMSF) by eight up-and-down 

5 strokes with a Teflon/glass homogenizes A postnuclear supernatant was obtained by centrifuging 
the homogenate at 5000 rpm in a JA-10 rotor (Beckman, CA) for 15 min. Soluble brain protein 
supernatant was obtained by centrifuging the postnuclear supernatant at 36,000 rpm in a Ti45 rotor 
(Beckman, CA) for one hour. 

Following the centrifiigation, a final concentration of 110 mM sodium phosphate, pH 7.4 

10 was added to the supernatant before it was applied to a 25 ml hydroxyapatite (BioRad, CA) column 
equilibrated with the homogenization buffer containing 110 mM sodium phosphate at a flow rate of 
20 ml/hr at 18°C. The column was washed with 75 ml of 0.15 M sodium phosphate in buffer A 
(0.15 M NaCl, and 1 mM DTT, pH 7.4) and eluted stepwise with 5 ml aliquots of 0.2M to 0.65 M 
phosphate in buffer A, with 50 mM phosphate increment per step. Fractions containing rsec6 and 

15 rsec8 as determined by Western blot analyzes were pooled and dialyzed twice against 2 liters of 20 
mM Tris, pH 8.0, 100 mM NaCl, 0.2 mM EDTA, and 0.5 mM DTT overnight. 

2. Fractogel EMD TMAE-650 (S) Chromatography . The dialyzed 
hydroxyapatite eluant was diluted with an equal volume of 20 mM Tris, pH 8.0 and loaded onto a 3 

20 ml column of Fractogel TMAE anion exchange resin (EM Separations Technology, NJ) equilibrated 
with 20 mM Tris, pH 8.0, 50 mM NaCl, and 1 mM DTT at a flow rate of 15 ml/hr. The column 
was washed with 20 mM Tris, pH 8.0, 50 mM NaCl, 1 mM DTT and eluted with a linear gradient 
of 0 to 350 mM NaCl in 20 mM Tris, pH 8.0 and 1 mM DTT. 

25 3. HW-55S gel Filtration Chromatography . TMAE eluant fractions enriched in 

rsec6 and rsec8 were pooled and fractionated over a 180 ml HW-55S gel filtration column 
(TosoHaas, PA) equilibrated in 20 mM Hepes, pH 7.4, 200 mM NaCl, and 1 mM DTT at a flow 
rate of 5 ml/hr. 

30 4. Second Fractoeel EMD TMAE-650 (S) Chromatography . rsec6 and 

rsec8-enriched fractions from HW-55S gel filtration column chromatography were pooled and 
diluted to a final buffer concentration of 20 mM Tris, pH 8.0 and 50 mM NaCl. The diluted 
sample was applied to and eluted from a 0.5 ml Fractogel TMAE-650 column as described above. 
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Fractions containing purified SA-17S complex were pooled and used for SDS-PAGE analyzes and 
peptide sequencing. 

C. Mass Spectrometry and Peptide Sequence Analysis 

5 Purified SA-17S complex was fractionated on an 8% SDS polyacrylamide gel. Individual 

protein bands were cut out and subjected to in-gel proteolysis by Lys C, trypsin and/or Glu C 
essentially as described by Stone and Williams (1993). Excised gel slices were dried by Speed-Vac, 
and 500 /xl of 0.1 M NH 4 HC0 3 was added to each gel slice. The slices were shaken at room 
temperature for at least 4 hours. The wash from each gel slice was removed and saved. An 

10 additional 500 ^1 of 0.1 M NH 4 HC0 3 was added to each gel slice, incubated overnight with 

shaking, and the wash removed and saved. A final 150 pi of 0.1 M NH 4 HC0 3 was added to each 
gel slice, plus 5 /d of 45 mM dithiothreitol. The gel slices were incubated at 50°C for 20 min., and 
cooled to room temperature. After addition of 5 pi 100 mM iodoacetamide, the slices were 
incubated in the dark at room temperature for 20 min. Proteolytic enzyme (Lys C, trypsin and/or 

15 Glu C) was added in a 1:25 weight-to-weight ratio of enzyme:protein (with a typical gel slice 

containing 8-10 /xg of protein). After incubating the gel slices overnight at 37°C, 500 /il of O.l M 
NH 4 HC0 3 was added. The gel slices were shaken for 8 hr to extract the peptides. The supernatant 
was transferred to a tube, and 300 pi 2M urea in O.l M NH 4 HC0 3 was added to each gel slice. 
After 24 hr, the supernatant was transferred to the corresponding tube. The combined supernatants 

20 were dried in a Speed-Vac and redissolved in 210 id H 2 0. The digested peptides were fractionated 
by HPLC (Applied Biosystems). Selected peptide peaks were then subjected to amino acid 
sequencing by Edman degradation method (Applied Biosystems, CA). Following digestion, some 
peptides from pl06, p96 and p79 were also fractionated on a 1090 HPLC (Hewlett-Packard). 
Selected HPLC fractions were subjected to MALDI-TOF mass analysis (Hewlett-Packard). 

25 Following mass spectrometric analysis, selected peptide fractions were subjected to amino acid 
sequencing (Hewlett Packard). 

D. Brain Regional and Tissue Culture Cell Western Bl ot Analysis 
For brain regional Western blot analysis, various rat brain regions were dissected from fresh 
30 brain and sonicated in 10 mM Hepes, pH 7.4, 2.5 mM KOAc, 1 mM MgCl 2 , 0.1 mM EGTA, and 
0.3 mM PMSF. The homogenates were centrifiiged at 10,000 xg for 5 min and the postnuclear 
supernatants were collected. 
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For Western blot analysis of the rsec8 distribution in various cultured cells, tissue culture 
cells were grown to confluency in 10 cm petri dishes, scraped, and dounce homogenized in 20 mM 
Hepes, pH 7.4. The homogenates were centrifuged at 5000 xg for 5 min and the supernatants were 
collected for Western blot analysis. 

5 Protein samples were electrophoresed on 12.5% resolving SDS-polyacrylamide (denaturing) 

gels, transferred to nitrocellulose paper (0.2 jtM), and probed as described below. Proteins were 
visualized by enhanced chemiluminescence (ECL) system (Amersham Corp., Arlington, Heights, IL) 
and/or by autoradiography using 125 I-labeled goat anti-rabbit antibodies as the secondary antibody. 
Protein bands were quantitated by densitometry and/or phosphorimaging (Molecular Dynamics, 

10 Sunnyvale, CA). 

E. Subcellular Localization of SA-17S Complex 

For brain subcellular fractionation studies, postnuclear supernatant and crude synaptosomal 
membrane fractions were prepared as described (Bennett, et al. y 1992). Soluble brain proteins and 

15 brain membranes were obtained by centrifuging the postnuclear supernatant or lysed crude 
synaptosomal fraction at 100,000 xg for 20 min in a TLS 55 rotor (Beckman, CA) at 4°C. All 
protein samples were subjected to SDS-PAGE and Western blot analysis. 

For brain membrane extraction studies, lysed brain membranes resuspended in 20 mM Tris, 
pH 8.0, 1 mM DTT and 10% sorbitol (2 mg/ml) were incubated with equal volumes of either 20 

20 mM Tris, pH 8.0, 3 M NaCl in 20 mM Tris, pH 8.0, 8% Triton X-100 in 20 mM Tris, pH 8.0, or 
8 M urea in 20 mM Tris, pH 8.0. The incubations were carried out at 4°C for 1 hr. Following the 
incubation, the membranes were pelleted at 50,000 xg. The resulting supernatants and membrane 
pellets were analyzed for their rsec8 content by Western blot. 

The amount of rsec8 extracted from membranes were quantitated by phosphorimaging using 

25 I25 I-labeled secondary antibodies. The pixel values for 20 mM Tris extraction were 6713 for 

supernatant and 173171 for pellet; for 1.5 M NaCl extraction were 39189 for supernatant and 45414 
for pellet; for Triton X-100 extraction were 56302 for supernatant and 37254 for pellet; and for urea 
extraction were 71419 for supernatant and 9999 for pellet. 

For sucrose gradient flotation analysis of SA-17S complex, lysed crude synaptosomal 

30 membranes were resuspended in 55% sucrose in gradient buffer (20 mM Hepes, pH 7.4, 150 mM 
NaCl and 1 mM DTT) at a protein concentration of 4 mg/ml. The resuspended membranes were 
placed at the bottom of a linear 25-52.5% sucrose gradient. After centrifiigation at 42,000 rpm for 
16 hrs in a TLS 55 rotor (Beckman, CA), the gradient was fractionated and analyzed by Western 
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blot for rsec6, rsec8, Na/K ATPase (a plasma membrane marker) and synaptophysin (a synaptic 
vesicle marker). 

F. Immunoprecipitation Studies 

5 1. Immunoprecipitation of the SA-17S Complex . Immunoaffmity purified 

anti-rsec8 antibodies (Ting, et ai, 1995) or purified rabbit immunoglobulin were coupled to protein 
A beads at a final concentration of 0.5 mg/ml as previously described (Pevsner, et al y 1994a; 
Pevsner, et ai, 1994b). Three ml of rat brain supernatant (Ting, et ai, 1995) was precipitated with 
5% polyethylene glycol 3000 at 4°C and centrifuged at 20,000 xg for 10 min. The pellet was 

10 dissolved in 1 ml 20 mM Tris, pH 8.0 and 50 mM NaCl and pre-cleared by incubation with 200 fi\ 
protein A beads for 4 hrs at 4°C. Following incubation, 500 /xl of pre-cleared brain supernatant was / 
incubated with 30 fi\ of either anti-rsec8 antibody or rabbit immunoglobulin coupled to protein A 
beads overnight at 4°C. The beads were then washed three times with 200 20 mM Tris, pH 8.0, 
150 mM NaCl and 0.05% Tween 20. Proteins bound to the beads were solubilized in protein 

15 sample buffer, and subjected to Western blot analysis. 

2. Immunoprecipitation of Synaptic Vesicles . Four frozen rat brains were 
powderized in liquid nitrogen using a Waring blender. The powderized brain was then homogenized 
in 20 ml of 0.3 M sucrose and 10 mM Hepes, pH 7.5 with a Teflon-glass homogenizer. The 

20 homogenate was centrifuged at 100,000 xg for 1 hr and the supernatant was incubated for 2 hrs at 
4°C with 50 pi of anti-SV2 monoclonal antibody or purified mouse immunoglobulin coupled 
Dynabeads M-500 (Dynal, Oslo; 16 pig antibody/50 yl beads) in 0.3 M sucrose, 10 mM Hepes, pH 
7.5 and 0.15% bovine serum albumin. Following incubation, the beads were washed four times 
with 0.3 M sucrose, 10 mM Hepes, pH 7.5 and solubilized in protein sample buffer. Protein 

25 samples were subjected to SDS-PAGE and analyzed by Western blotting. 

3. Co-immunoprecipitation of svntaxin and the SA-17S complex . Anti-rsec8 
monoclonal antibodies 2E9 and 17A10 or mouse immunoglobulin were coupled to protein G beads at 
a final concentration of 2 mg/ml using the cross-linker dimethylpimelimidate as previously described 

30 (Pevsner, et a/., 1994a). Rat brain membranes prepared as described above were resuspended in 20 
mM Tris, pH 8.0, 150 mM NaCl and solubilized with either 1% CHAPS or Triton X-100 at a final 
protein concentration of 2 mg/ml. The solubilized sample was incubated at 4°C for 30 min before 
centrifuging at 20,000 xg for 20 min. Following centrifugation, the solubilized brain membranes 
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were pre-cleared by incubation with protein G beads (1 ml solubilized membranes per 200 jd protein 
G beads) for 30 min at 4°C. The pre-cleared supernatant was then incubated with 20 /*1 protein G 
beads with either immobilized anti-rsec8 monoclonal antibodies 2E9/17A10 or immobilized control 
mouse Ig for 4 hrs at 4°C. The beads were then washed three times with 300 fxl 20 mM Tris, pH 
5 8.0, 150 mM NaCl containing either 0.7% CHAPS or Triton X-100. Proteins bound to the beads 
were analyzed by Western blotting. 

G. Monoclonal and Polyclonal Antibodies 

Mouse polyclonal antibodies against rsec6 and rsec8 were generated against a His-tagged 
10 rsec6 fusion protein. The mouse serum recognizes a single band of 86 kDa in brain homogenate. 
This band is not present when pre-immune serum was used as the primary antibody. 

Mouse polyclonal antibodies against rsec8 were generated against a His-tagged rsec8 fusion 
protein. The immunized but not pre-immune mouse serum recognizes one single band at 110 kDa in 
brain homogenate. 

15 Monoclonal antibodies against rsec8, including antibodies 2E9 and 17 A 10, were obtained 

from the corresponding hybridoma cell lines generated by fusion of NS-1 mouse myeloma cells with 
spleen cells from BALB/c mice immunized with His-tagged rsec8 fusion protein (Lane, et al y 
1986). 

Rabbit polyclonal antibodies against rsec8 were generated and affinity-purified as described 
20 (Ting, etal, 1995). 

Polyclonal antibodies against nsecl were affinity purified as described (Pevsner, et ai, 

1994b). 

Rabbit polyclonal antibodies against Na/K ATPase (Mays, et al. 9 1995) was obtained from 
Dr. W. James Nelson. Monoclonal antibody against SV2 (Buckley and Kelly, 1985) was obtained 
25 from Dr. Kathy M. Buckley, and monoclonal antibody against synaptophysin was purchased from 
Boehringer Mannheim (Indianapolis, IN). 

H. Hippocampal Cell Culture and Immunocvtochemistrv 

Primary hippocampal CA3/CA1 cultures were obtained and maintained as described by 
30 Malgaroli and Tsien, 1992. Immunocytochemistry was carried out as previously described (Ting, et 
al. 9 1995) using affinity-purified anti-rsec8 mouse polyclonal antibodies at 0.01 /ig/ml and 
affinity-purified rabbit anti-synaptotagmin rabbit polyclonal antibodies at 1:1000 dilution as primary 
antibodies. The labeling was visualized using fluorescein-labeled donkey anti-mouse and 
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Cy3-labeled donkey anti-rabbit antibodies. In control experiments, identical staining procedures 
were used except that the cells were labeled using pre-immune sera or purified immunoglobulin of 
equal protein concentration as the primary antibody instead. 

5 EXAMPLE 1 

rsec6 and rsec8 are Components of the Secretion-Associated 17S (SA-17S) Complex in Brain 
To determine whether rsec6 and rsec8 function together in a protein complex, soluble brain 
proteins were fractionated on a linear glycerol gradient and the gradient fractions were analyzed by 
both sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE) and Western blotting. 

10 A Coomassie blue-stained protein profile of the fractions is shown in Figure 1A. The positions of 
rsec8, rsec6 and nsecl on the gel are indicated on the right of the figure as 8, 6, and 1 respectively. 
Most of the proteins fractionated between the 4.3S and 11. 2S regions of the gradient. 

The migration of rsec6 and rsec8 in the gradient was monitored by Western blot analysis 
using affinity-purified anti-rsec6 and anti-rsec8 antibodies. The results are shown in Figure IB. 

15 Fraction 1 corresponds to the top of the gradient. Size markers (indicated by the arrows) from left 
to right are bovine serum albumin (4.3S), 0-amylase (11. 2S) and thyroglobulin (19.2S). Both rsec6 
and rsecS migrated at the 17S position. In contrast, nsecl, another synaptosomal protein, migrated 
to the 4.3S position as previously observed (Pevsner, et al, 1994b). This result indicates that all of 
the soluble rsec6 and rsec8 protein is in a large complex and that little or no protein is found in 

20 monomeric state. 

Because the Coomassie blue-stained glycerol gradient protein profile (Figure 1A) showed the 
presence of many proteins in the 17S region, it was necessary to determine whether the 
co-fractionating rsec6 and rsec8 were in the same complex. 

To address this question, rsec6 and rsec8 were immunoprecipitated from polyethylene 
25 glycol-fractionated rat brain supernatant with affinity-purified antibodies against rsec8 or with 
control rabbit immunoglobulin. The immunoprecipitated sample was analyzed for the presence of 
rsec6, rsec8 and nsecl by Western blotting. 

The results are shown in Figure 1C. Brain is the starting material; Ab represents proteins 
bound to the anti-rsec8 antibody in the absence of brain homogenate; Ab + brain represents proteins 
30 bound to the anti-rsec8 antibody following incubation with brain homogenate; and Ig + brain 
represents proteins bound to the control rabbit immunoglobulin following incubation with brain 
homogenate. The results show that rsec6, but not nsecl, co-immunoprecipitated with rsec8 from 
soluble brain proteins using affinity-purified rabbit polyclonal anti-rsec8 antibodies. No rsec6, rsec8 
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or nsecl were immunoprecipitated by control rabbit immunoglobulin, confirming the specific 
binding of the SA-17S complex to anti-rsec8 antibodies. These data demonstrate that both rsec6 and 
rsec8 are components of the same high molecular weight complex. 

Example 2 

Purification of the Mammalian SA-17S Complex 
The SA-17S complex was purified from frozen rat brain by four sequential column 
chromatographic steps to define other members of the complex. Figure 2A shows a Coomassie 
blue-stained SDS polyacrylamide gel analysis of pooled column fractions. Abbreviations are as 
follows: Lys, starting soluble brain proteins; HT, hydroxyapatite ehiant; TMAE, TMAE anion 
exchanger eluant; GF, HW55S gel filtration eluant; TMAE2, second TMAE anion exchanger eluant. 
Positions of rsec6 and rsec8 on the gel are indicated on the right as 6 and 8, respectively. Fifteen 
fig of proteins were loaded in lane Lys, HT and TMAE; eight /ig of proteins were loaded in lane 
GF; and three \ig of proteins were loaded in lane TMAE2. 

The enrichment of the SA-17S complex following each chromatographic step was monitored 
by Western blot analysis using mouse polyclonal antibodies against rsec6 and rsec8, as shown in 
Figure 2B. Initially, cytosol prepared from frozen rat brains was applied to and step-eluted from 
hydroxyapatite. No significant amount of rsec6 and rsec8 was detected in the flow-through from 
this column. The pooled hydroxyapatite eluant exhibiting an enrichment of rsec6 and rsec8 
immunoreactivity was concentrated and further purified using a strongly basic anion exchange resin, 
TMAE-Fractogel. All rsec6 and rsec8 immunoreactivity was retained by the resin and was eluted by 
a linear salt gradient. Next, taking advantage of the large size of the complex, the TMAE eluant 
was further fractionated by HW55-S gel filtration chromatography. The rsec6 and rsec8 
immunoreactivity was eluted in a single peak with an apparent native molecular weight of 650 - 700 
kDa. At this stage of purification, eight polypeptides of molecular weights between 70 kDa and 110 
kDa were visible on a SDS polyacrylamide gel stained with Coomassie blue (Figure 2A). 

The last stage of purification involved a second anion exchange chromatography utilizing 
TMAE-Fractogel. Analysis of salt-eluted fractions from this column by SDS-PAGE revealed that 
elution of eight distinct proteins with molecular weights ranging from 71 kDa to 110 kDa (Figure 
2C) coincided exactly with the elution of rsec6 and rsec8 immunoreactivity. SDS-PAGE analysis of 
pooled peak rsec6 and rsec8 immunoreactivity fractions indicated that these same eight proteins 
constituted at least 95% of the total proteins in this fraction (Figure 2A, lane TMAE2; Figure 2C). 
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Fractionation of the purified complex on a glycerol gradient demonstrated that all eight 
components co-migrate at the 17S position. In addition, all eight proteins co-immunoprecipitated 
with rsec8 monoclonal antibodies. Approximately 100-200 /xg of SA-17S complex was recovered 
from 50 rat brains after purification. Quantitation of the enrichment of the SA-17S complex by 

5 Western blot using 125 I-labeled secondary antibodies suggests that the complex constitutes 

approximately 0. 1 % of the total soluble brain protein. All eight proteins were approximately equally 
stained by Coomassie blue (although p79 reproducibly appears to stain somewhat less intensely) 
suggesting a stoichiometry of one copy of each protein within the complex (Figure 2C). This 1:1 
stoichiometry is further supported by the apparent molecular weight of 600-700 kDa of the native 

10 complex. Peptide sequences from protein bands of 110 kDa and 86 kDa indicate that they are rsec8 
and rsec6, respectively. 

Example 3 

Peptide Sequencing of the SA-17S 
15 Complex Components 

To further characterize individual components of the mammalian SA-17S complex, the 

purified complex was fractionated on an SDS polyacrylamide gel and each polypeptide band was cut 

out and individually subjected to in-gel proteolysis with trypsin, Lys C and/or Glu C. The digested 

peptides were eluted from the gel, fractionated by high performance liquid chromatography (HPLC), 

20 and in many cases, analyzed by mass spectrometry for peptide purity and mass. 

Figures 3A, 3B, 3C and 3D show the mass spectrometric analyzes of HPLC peptide peaks 

from four proteins, pl02, p96, p84 and p71, respectively. A single peptide peak in the mass 

spectrometry profile suggests the presence of one peptide species in the analyzed HPLC peptide 

peak, as shown in Figs. 3A, 3B, 3C and 3D. The multiple peaks shown in Fig. 3D are due to the 

25 binding of different amounts of salt to the peptide (amino acid sequence analysis of this peptide peak 

confirmed the presence of only one peptide species). The mass of each peptide predicted by mass 

spectrometry is shown above the peptide peak. The amino acid sequence of each peptide determined 

by peptide sequencing and its mass (protonated form of peptide) calculated from the determined 

peptide sequence is shown beside corresponding peptide peak, along with its SEQ ID NO. In Panels 

30 A to C the calculated and observed masses matched with less than 0.1% difference. 

Figures 4A-H, as well as Tables 1-8 below, show all the peptide sequences obtained from 

the eight predicted members of the SA-17S complex. Protein components of purified SA-17S 

complex were digested with a combination Trypsin (cuts after a lysine or an arginine residue), Lys 

C (cuts after a lysine residue) and/or Glu C (cuts after a glutamic acid) as described in the 
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Experimental Procedures, pi 10 or rsec8 peptides were obtained from a combination of Lys C and 
Trypsin digestion; pl06 peptides 1-6 were obtained from a combination of Lys C and Trypsin 
digestion and pl06 peptides 7-8 were obtained from Lys C digestion; pl02 peptides 1-4 were 
obtained from a combination of Lys C and trypsin digestion and pl02 peptides 5-7 were obtained 

5 from Lys C digestion; p96 peptides 1-3 were obtained from Lys C digestion and p96 peptides 4-6 
were obtained from a combination of Lys C and Glu C digestion; p86 or rsec6 peptide 1 was 
obtained from Lys C digestion and p86 or rsec6 peptides 2-3 were obtained from a combination of 
Lys C and Glu C digestion; p84 peptide 1 was obtained from Lys C digestion and p84 peptides 2-9 
were obtained from a combination of Lys C and Glu C digestion; p79 peptides 1-6 were obtained 

10 from Lys C digestion and p79 peptides 7-10 were obtained from a combination of Lys C and Glu C 
digestion; p71 peptides were obtained from Lys C digestion. 



Table 1 - rsec6 



20 



Name 


SEQ ID NO: 


Sequence 


rsec6 .1 


5 


DFRQSINTIEXL 


rsec6 .2 


6 


QGPSQASPNYXP 


rsec6 .3 


7 


AAIQSQLDGVRTGLSQ 


Table 2 - rsec8 


| Name 


SEQ ED NO: 


Sequence 


1 rsec8.1 


8 


XAPEGPLIDVXNI 


rsec8.2 


9 


EFAAFFAK 


rsec8.3 


10 


XLGVQRPLLQSTSIXE 



30 

Table 3 • SA-17S p71 



| Name 


SEQ ID NO: 


Sequence 


| P71.1 


11 


XNQVAFQHFQELDEHI 


p71.2 


12 


VCHLXDQLEXVN 


1 P71.3 


13 


LHLIAQELPFDRFSEVK 
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Tahle 4 - SA-17S d79 



Name 


SEQ ID NO: 


Sequence II 


p79.1 


14 


XTDY1AE 


p79.2 


15 


ETYGAFLSRSXG 


p79.3 


16 


XXPPQGVPYNPASP 




17 


XXQEEETLMFIRGN 


p79.5 


18 


ALFIRDDXQF 


p79.6 


19 


NLPVFQSCL 


p79.7 


20 


AVEYFQDKFPD 


p79.8 


21 


YRVEQVGDMIDRLFDTS 


p79.9 


22 


VYEDPALSAIFLHNNYNY 


p79.10 


23 


XXYGAFLHRYSSVPFVYGXH j| 




Table 5 - SA-17S d84 


Name 


SEQ ID NO: 


Sequence 


p84.1 


24 


QLSQQSDGDRDLQEWRQRVQALAEETAQYK 


p84.2 


25 


XLQLSFNFSEPNRQRP 


p84.3 


26 


SIPLALLPAAAAGA 


p84.4 


27 


DAVXQNSTQAAETEN 


p84.5 


28 


DYRNDEA 


p84.6 


29 ! 


ENNPEEDDPS 


p84.7 


30 


XLSQQSDXG 


p84.8 


31 


AAALRAPPXVTS 


p84.9 


32 


XKREPLE 
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Table 6 - SA-17S p96 



10 



Name 


SEQ ID NO: 


Sequence 


p96.1 


33 


WGQFPFQDTELEK 


p96.2 


34 


XVYEIFDN 


p96.3 


35 


DFLESIR 


p96.4 


36 


XDQDLQLADYDHMT 


p96.5 


37 


STNLLLTRTLXN 


Table 7 - SA-17S d102 


Name 


SEQ ID NO: 


Sequence 


p 102.1 


38 


YLSGLQAPGXPASQSIGAQ 


pl02.2 


39 


GGLSTFFEAQDALSAIHQK 


pl02.3 


40 


ASNTADTXRQER 


pl02.4 


41 


XRENYIEGGI 


pl02.5 


42 


ENLGRLFENYI 


pl02.6 


43 


XDYDW1NDYE 


pl02.7 


44 


XIPXLSTRPANP 


Table 8 - SA-17S d106 


Name 


SEQ ID NO: 


Sequence 


pl06.1 


45 


XXYGEIAXK 


pl06.2 


46 


XATVSLPEK 


pl06.3 


47 


XDYGVIAND 


pl06.4 


48 


LKYFFMVASVK 


pl06.5 


49 


ELPEFNLHFFK 


p!06.6 


50 


XLQDVDLASXR 


pl06.7 


51 


XNRXNEPAVNVL 


p!06.8 


52 


XQLXNIVEPEXIY 



X" represents residues whose identity has not been unambiguously determined. 
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As mentioned above, peptide sequences from pi 10 and p86 confirmed that these two proteins 
are rsec8 and rsec6 respectively. Furthermore, all of the peptide sequences from these two protein 
bands match the amino acid sequence predicted by rsec6 and rsec8 cDNAs demonstrating that the 
bands are free of contaminating proteins and that the SA-17S complex is substantially pure or 
5 substantially purified. 

Comparison of remaining peptide sequences to "GENBANK" and a database of human 
expressed sequence tag (EST) cDNAs generated by the Institute for Genomic Research (TIGR) found 
matches to two proteins (p71 and p79). Overall, the amino acid sequences match the sequences 
predicted from the cDNAs with an accuracy greater than 97%. Taken together, these results suggest 
10 that members of the SA-17S complex are novel proteins which have not been previously 
characterized. 

Example 4 
Expression Pattern of the SA-17S Complex 

15 To determine the site of SA-17S complex function in brain, the brain regional distribution of 

these proteins was examined using SDS-PAGE/Western blot analyzes as described above (10 /xg of 
proteins loaded per lane). Figure 5 A shows that rst^S is expressed in all regions of rat brain 
examined and there is no significant brain regional variation. A similar expression pattern is also 
observed for rsec6, suggesting that both proteins are required by all brain cells. 

20 To determine whether the SA-17S complex functions in multiple exocytotic pathways in 

addition to the regulated pathway underlying neurotransmission, the presence of rsec8 in cell lines 
derived from kidney, ovary, pituitary and adrenal medulla tissues was investigated with SDS- 
PAGE/Western blot analyzes as described above. The results are shown in Figure 5B. 
Affinity-purified anti-rsec8 rabbit polyclonal antibodies detected rsec8 as a 110 kDa protein in all the 

25 cell lines examined. rsec6 was also observed in all cell lines examined. The cells with the highest 
amounts of rsec8 were AtT-20 and PC-12 cells. Both of these cell lines have regulated secretory 
pathways and contain numerous secretory vesicles as well as synaptic proteins, indicating that the 
expression of the SA-17S complex may parallel the level of secretory activity in cells. 

30 Example 5 

Subcellular Localization of the SA-17S Complex 
Cell fractionation studies were carried out to determine the solubility properties of the SA-17S 
complex. Rat brain homogenate was subjected to a 1,000 xg spin to yield postnuclear supernatant 
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(PNS). The PNS was then subjected to either (i) a 100,000 xg spin to yield soluble (PNS sup) and 
membrane protein (PNS pellet) fractions or (ii) a 25,000 xg spin to yield a supernatant and a crude 
synaptosomal pellet (crude synaptosomes). The synaptosomal pellet was lysed in 2 mM Hepes, pH 
7.4 and homogenized with a Polytron. The homogenate was then subjected to a 100,000 xg spin to 

5 yield both soluble (cytosol) and membrane (memb) fractions. The membrane pellets were 

resuspended in 20 mM Hepes, pH 7.4 to volumes equal to that of the corresponding supernatants. 
Five fi\ of each protein sample was loaded per lane. 

Figure 6A shows results of a Western blot analysis for the presence of rsec6 and rsec8 in the 
subcellular fractionation of the SA-17S complex. Upon homogenization of freshly dissected rat 

10 brain, rsec6 and rsec8 were present in both soluble and insoluble fractions. Approximately 25% and 
75% of rsec6 and rsec8 were found in the supernatant and the membrane pellet respectively. 
Similarly, in crude synaptosomes, the majority of rsec6 and rsec8 was found associated with 
synaptosomal membranes. Very little rsec6 and rsec8 were found in the synaptosomal cytosol. 
However, in frozen brain more rsec6 and rsec8 (approximately 50%) was found in the soluble 

15 fraction. The presence of rsec6 and rsec8 in the membrane fraction was not due to the entrapment 
of the complex in intact cells or organelles in the postnuclear supernatant because the distribution of 
these two proteins was similar when rat brain was completely homogenized in hypotonic buffer with 
a Polytron. Likewise, the presence of these two proteins in the supernatant was not due to 
contaminating membrane fragments because both SV2, a synaptic vesicle marker, and Na/K 

20 ATPase, a plasma membrane marker, were detected only in synaptosomal membrane pellet and not 
in synaptosomal cytosol. 

To determine if the pool of the SA-17S complex that pelleted with crude synaptosomal 
membranes at 100,000 xg was due to its association with synaptosomal membranes or with large 
proteinaceous aggregates, crude synaptosomal membranes were subjected to sucrose gradient 

25 flotation. Crude synaptosomal membranes were placed at the bottom of a sucrose gradient and 
allowed to float up into the gradient. The flotation of the SA-17S complex into the gradient was 
monitored by Western blot analysis using mouse polyclonal antibodies. The migration of plasma 
membrane and synaptic vesicles was also monitored using rabbit polyclonal antibodies against Na/K 
ATPase and a monoclonal antibody against synaptophysin respectively. Fraction 1 corresponds to 

30 the top of the gradient. 

The results are shown in Figure 6B. Both rsec6 and rsec8 floated up into the sucrose 
gradient, as observed with both Na/K ATPase, and synaptophysin (a synaptic vesicle marker). Very 
little rsec6 and rsec8 were detected at the bottom of the gradient as protein aggregates. These data 
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suggest that the insoluble pool of SA-17S complex is associated with synaptosomal (brain) 
membranes. 

To investigate the nature of SA-17S complex association with membranes, brain membranes 
were extracted with 20 mM Tris buffer, 1.5 M NaCl, 4% Triton X-100, or 4 M urea. The 
5 extracted membranes were centrifiiged and the resulting supernatants and membrane pellets were 
subjected to Western blot analyzes. Lysed brain membranes were extracted with either 20 mM Tris, 
pH 8.0, 1.5 M NaCl, 4% Triton X-100 or 4 M urea. Following extraction, the membranes were 
centrifiiged to yield soluble (s) and membrane (p) fractions. The presence' of rsec8 in these fractions 
was detected and quantitated by phosphorimaging using anti-rsec8 primary and ,25 I-labeled secondary 
10 antibodies (see Materials and Methods, above). 

The results are shown in Fig. 6C. Approximately 4%, 46%, 60% and 88% of the SA-17S 
complex were extracted by Tris buffer, 1.5 M NaCl, 4% Triton X-100 and 4 M urea, respectively. 
The incomplete extraction of the SA-17S complex in the presence of a high concentration of salt 
suggests that this complex may bind very tightly to the membranes. Interestingly, a fraction of the 
15 native SA-17S complex is found in the Triton X-100 insoluble pellet following membrane 
solubilization, contrary to the observation that purified soluble SA-17S complex is completely 
soluble in Triton X-100, suggesting that the membrane-bound complex may be associated with 
cytoskeletal elements. Taken together, these observations suggest that the insoluble pool of SA-17S 
complex may be associated with cytoskeletal elements which, in turn, bind to membranes. 
20 Since rsec6 and rsec8 floated to the same region of the sucrose gradient as synaptic vesicles 

and plasma membranes, the possibility of an association between the SA-17S complex and synaptic 
vesicles was investigated. Synaptic vesicles were immunoprecipitated using an anti-SV2 (a synaptic 
vesicle-specific integral membrane protein) monoclonal antibody, or control mouse immunoglobulin, 
coupled to Dynabeads M-500. The immunoprecipitated vesicles were examined for the presence of 
25 synaptotagmin and rsec8 by Western blotting. 

Figure 6D shows one such Western blot analysis. While the synaptic vesicle-specific protein 
synaptotagmin was present in anti-SV2 antibody immunoprecipitated vesicles, no rsec8 was detected 
even upon longer exposure. Neither protein was immunoprecipitated by control mouse 
immunoglobulin indicating a specific association of synaptic vesicles with anti-SV2 antibody. These 
30 results suggest that SA-17S complex is not associated strongly or stably with synaptic vesicles 
leaving the plasma membrane as the likely site of its membrane localization. 

Example 6 
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Localization of rsec8 in Cultured 
Hi ppocampal Neurons 

To confirm the presence of the SA-17S complex in synapses, its localization in primary 

hippocampal cultures was studied by immunofluorescence microscopy. Fifteen to twenty day-old 

5 cultures of primary hippocampal neurons were fixed and immunostained with antibodies against 
rsec8 (Figs. 7A and 7B) and synaptotagmin (Fig. 7C). Fig. 7D shows a larger magnification of 
rsec8 staining overlaid with that of synaptotagmin at two synaptic terminals indicated by arrows. 
Synaptic terminals in these cultures were identified by the presence of the synaptic vesicle marker, 
synaptotagmin. Bar = 7/xm. 

10 Figure 7A shows that rsec8 labeling was detected throughout hippocampal neurons, including 

the cell bodies, dendrites, axons and nerve terminals. Brighter rsec8 staining at synapses suggests 
an accumulation of the SA-17S complex in nerve terminals (Figs. 7A and B). Although both 
synaptotagmin and rsec8 were detected in synaptic terminals, staining of rsec8 was more 
concentrated near the plasma membrane than that of synaptotagmin (Figs. 7B, 7C and 7D). Many 

15 terminals displayed rsec8 labeling in peripheral areas where it did not co-localize with the more 
centrally positioned synaptotagmin staining. 

Glial and other non-neural cells in the hippocampal neuron cultures were also stained, 
although typically more weakly than the neurons. As controls, pre-immune immunoglobulins of 
same protein concentrations as the anti-rsec8 antibodies were used as the primary antibodies. At the 

20 same time exposures, no staining was observed in these neurons using identical labeling procedures. 
Taken together, these biochemical and histochemical data suggest that the SA-17S complex is 
localized to the plasma membrane. 



Example 7 

25 Co-immunoprecipitation of rsec8 

Complex with Svntaxin 

Monoclonal antibodies were generated against rsec8 to study the properties of the SA-17S 

complex, including its membrane association in brain. The specificity of seven monoclonal 

antibodies generated against rsec8 was investigated by Western blot analysis as described above (ten 

30 fig of brain postnuclear supernatant were loaded per lane). As shown in Figure 8A, all these 

antibodies (with the exception of mAbl5E12) recognized a single band of 110 kDa in the brain 

postnuclear supernatant. Monoclonal antibody 15E12 detected a major band of 110 kDa in addition 

to two minor bands of 65 kDa and 90 kDa. These two lower molecular weight bands are likely to 

be degradation products of rsec8. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Scheller, Richard H. 

(ii) TITLE OF INVENTION: Methods and Compositions for Modulation 

of Vesicular Release 

(iii) NUMBER OF SEQUENCES: 62 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Dehlinger & Associates 
15 (B) STREET: 350 Cambridge Avenue, Suite 250 

(C) CITY: Palo Alto 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 94306 

20 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

25 (D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: To be assigned 

(B) FILING DATE: 19-DEC-1997 
30 (C) CLASSIFICATION: 



(vii) PRIOR APPLICATION DATA 

(A) APPLICATION NUMBER: 60/033,905 

(B) FILING DATE: 20-DEC-1996 

35 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Dehlinger, Peter J. 

(B) REGISTRATION NUMBER: 28,006 

(C) REFERENCE /DOCKET NUMBER: 8600-0179.30 



(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (650) 324-0880 

(B) TELEFAX: (650) 324-0960 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2265 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE : NO 



15 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: rsec6 

(ix) FEATURE: 

(A) NAME /KEY : CDS 
20 (B) LOCATION: 1..2265 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

25 ATG TGC AAA GAT TCT GCC TGC TTT TCG ACT ATG AAG GAG ACA GAC CTG 
48 

Met Cys Lys Asp Ser Ala Cys Phe Ser Thr Met Lys Glu Thr Asp Leu 
15 10 15 

30 GAG GCT GTT GCA ACA GCA GTC CAA AGG GTG GCT GGG ATG CTT CAG CGC 
96 

Glu Ala Val Ala Thr Ala Val Gin Arg Val Ala Gly Met Leu Gin Arg 
20 25 30 

35 CCA GAC CAG CTG GAC AAA GTG GAG CAG TAT CGC AGA AGG GAG GCT CGG 144 
Pro Asp Gin Leu Asp Lys Val Glu Gin Tyr Arg Arg Arg Glu Ala Arg 
35 40 45 

AAG AAG GCC TCT GTG GAG GCC AGG CTA AAG GCC GCA ATC CAG TCT CAA 192 
40 Lys Lys Ala Ser Val Glu Ala Arg Leu Lys Ala Ala He Gin Ser Gin 
50 55 60 



45 



CTA GAT GGC GTC CGC ACA GGC CTA AGC CAA CTG CAC AAT GCA CTG AAT 
Leu Asp Gly Val Arg Thr Gly Leu Ser Gin Leu His Asn Ala Leu Asn 
65 70 75 80 
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38 

GAT GTC AAG GAT ATC CAG CAG TCA CTG GCT GAT GTG AGC AAG GAC TGG 
Asp Val Lys Asp He Gin Gin Ser Leu Ala Asp Val Ser Lys Asp Trp 
85 90 95 



288 



AGG CAG AGC ATC AAC ACC ATC GAG AGT CTC AAG GAC GTA AAA GAC GCA 
Arg Gin Ser He Asn Thr He Glu Ser Leu Lys Asp Val Lys Asp Ala 
100 105 HO 



336 



GTG GTG CAG CAC AGC CAG CTG GCT GCA GCT GTG GAG AAC CTC AAG AAC 
10 Val Val Gin His Ser Gin Leu Ala Ala Ala Val Glu Asn Leu Lys Asn 
115 120 125 



384 



15 



ATA TTC TCT GTG CCT GAG ATT GTG AGG GAG ACC CAA GAT CTC ATC GAG 
lie Phe Ser Val Pro Glu He Val Arg Glu Thr Gin Asp Leu He Glu 
130 135 140 



432 



20 



CAA GGG GCT CTC CTG CAG GCC CAC CGG AAG CTA ATG GAT CTG GAG TGC 480 
Gin Gly Ala Leu Leu Gin Ala His Arg Lys Leu Met Asp Leu Glu Cys 
145 150 155 160 

TCC CGG GAT GGG CTA ATG TGT GAG CAG TAC CGC ATG GAC AGT GGG AAC 52 8 

Ser Arg Asp Gly Leu Met Cys Glu Gin Tyr Arg Met Asp Ser Gly Asn 
165 170 175 



25 AAA CGG GAC ATG ACC CTC ATT CAT GGC TAC TTC GGT AGC ACA CAG GGG 
Lys Arg Asp Met Thr Leu He His Gly Tyr Phe Gly Ser Thr Gin Gly 
180 185 190 



576 



CTC TCT GAC GAG CTG GCC AAG CAG CTG TGG ATG GTG CTG CAA AGA TCG 
30 Leu Ser Asp Glu Leu Ala Lys Gin Leu Trp Met Val Leu Gin Arg Ser 
195 200 205 



624 



CTG GTC ACT GTC CGC CGG GAT CCC ACC TTG CTG GTC TCC GTG GTC AGG 
35 Leu Val Thr Val Arg Arg Asp Pro Thr Leu Leu Val Ser Val Val Arg 
210 215 220 



672 



40 



ATC ATT GAA AGG GAA GAA AAA ATT GAC AGG CGG ATA CTT GAT CGA AAA 
He He Glu Arg Glu Glu Lys He Asp Arg Arg He Leu Asp Arg Lys 
225 230 235 240 



720 



45 



AAG CAA ACT GGG TTT GTT CCT CCT GGA AGG CCC AAA AAC TGG AAG GAA 
Lys Gin Thr Gly Phe Val Pro Pro Gly Arg Pro Lys Asn Trp Lys Glu 
245 250 255 
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39 

AAA ATG TTT GCC GTC TTG GAC AGA ACT GTG ACG ACT AGA ATC GAA GGC 816 
Lys Met Phe Ala Val Leu Asp Arg Thr Val Thr Thr Arg lie Glu Gly 
260 265 270 

ACG CAG GCG GAC ACC AGA GAA TCT GAC AAG ATG TGG CTC GTG CGC CAC 864 
Thr Gin Ala Asp Thr Arg Glu Ser Asp Lys Met Trp Leu Val Arg His 
275 280 285 

CTG GAG ATC ATT AGG AAG TAC GTC CTG GAT GAC CTC GTC ATC GCC AAG 912 
Leu Glu He He Arg Lys Tyr Val Leu Asp Asp Leu Val He Ala Lys 
290 295 300 



AAC CTG CTG GTG CAG TGC TTC CCT CCT CAC TAT GAC ATC TTT AAG AAC 960 
Asn Leu Leu Val Gin Cys Phe Pro Pro His Tyr Asp He Phe Lys Asn 
15 305 310 315 320 

CTC CTG AGC ATG TAC CAC CAG GCC TTG AGC ATT CGG ATG CAG GAC CTC 1008 
Leu Leu Ser Met Tyr His Gin Ala Leu Ser He Arg Met Gin Asp Leu 
325 330 335 

20 

GCC TCA GAG GAC CTT GAG GCC AAC GAG ATT GTG AGC CTC TTG ACC TGG 1056 
Ala Ser Glu Asp Leu Glu Ala Asn Glu lie Val Ser Leu Leu Thr Trp 
340 345 350 

25 GTC CTA AAT ACC TAC ACC AGT GCA GAG ATG ATG GGG AAT GTG GAG CTG 1104 
Val Leu Asn Thr Tyr Thr Ser Ala Glu Met Met Gly Asn Val Glu Leu 
355 360 365 



30 GCC CCA GAG GTG GAC GTT AAT GCC CTG GAA CCG CTC CTC TCA CCA AAC 1152 
Ala Pro Glu Val Asp Val Asn Ala Leu Glu Pro Leu Leu Ser Pro Asn 
370 375 380 

GTG GTC TCT GAG CTG CTC GAC ACA TAC ATG TCA ACA CTC ACG TCC AAC 1200 
35 Val Val Ser Glu Leu Leu Asp Thr Tyr Met Ser Thr Leu Thr Ser Asn 
385 390 395 400 

ATC ATT GCC TGG CTT CGG AAA GCA CTG GAG ACA GAC AAG AAA GAC TGG 1248 
He He Ala Trp Leu Arg Lys Ala Leu Glu Thr Asp Lys Lys Asp Trp 
40 405 410 415 

AGC AAA GAG ACG GAG CCG GAA GCA GAC CAG GAC GGC TAT TAT CAG ACG 1296 
Ser Lys Glu Thr Glu Pro Glu Ala Asp Gin Asp Gly Tyr Tyr Gin Thr 
420 425 430 
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ACA CTT CCT GCC ATT GTA TTC CAG ATG TTT GAA CAG AAT CTT CAA GTT 1344 
Thr Leu Pro Ala lie Val Phe Gin Met Phe Glu Gin Asn Leu Gin Val 
435 440 445 

5 GCT GCT CAA ATA AGT GAA GAT TTG AAA ACA AAG GTA TTA GTT TTG TGT 1392 
Ala Ala Gin He Ser Glu Asp Leu Lys Thr Lys Val Leu Val Leu Cys 
450 455 460 

CTT CAG CAA ATG AAT TCT TTC CTA AGT AGA TAC AAA GAG GAA GCC CAG 1440 
10 Leu Gin Gin Met Asn Ser Phe Leu Ser Arg Tyr Lys Glu Glu Ala Gin 
465 470 475 480 

CTT TAC AAA GAG GAG CAC CTG AGA AAC CGG CAG CAC CCA CAC TGT TAC 1488 
Leu Tyr Lys Glu Glu His Leu Arg Asn Arg Gin His Pro His Cys Tyr 
15 485 490 495 

GTG CAG TAC ATG GTC GCC ATC ATC AAT AAC TGC CAG ACC TTC AAA GAA 1536 
Val Gin Tyr Met Val Ala He He Asn Asn Cys Gin Thr Phe Lys Glu 
500 505 510 



20 



25 



TCC ATC ATC AGT CTG AAA AGA AAA TAC CTA AAA CCC GAA ACA GAG GAA 1584 
Ser He lie Ser Leu Lys Arg Lys Tyr Leu Lys Pro Glu Thr Glu Glu 
515 520 525 



AGC CTG TGT CAG AGT CAG CCC AGC ATG GAT GGG ATT CTA GAT GCC ATT 1632 
Ser Leu Cys Gin Ser Gin Pro Ser Met Asp Gly He Leu Asp Ala He 
530 535 540 

30 GCT AAG GAA GGC TGC AGC AGT CTG CTA GAG GAG GTC TTC CTG GAT CTA 1680 
Ala Lys Glu Gly Cys Ser Ser Leu Leu Glu Glu Val Phe Leu Asp Leu 
545 550 555 560 

GAG CAA CAT CTG AAT GAG CTA ATG ACA AAG AAG TGG ATG TTA GGA TCA 1728 
35 Glu Gin His Leu Asn Glu Leu Met Thr Lys Lys Trp Met Leu Gly Ser 

565 570 575 

AAT GCT GTG GAC ATC ATC TGT GTC ACC GTA GAA GAC TAC TTC AAT GAC 1776 
Asn Ala Val Asp He He Cys Val Thr Val Glu Asp Tyr Phe Asn Asp 
40 580 585 590 

TTC GCC AAA ATT AAA AAG CCA TAC AAA AAG AGG ATG ACA GCT GAG GCA 1824 
Phe Ala Lys He Lys Lys Pro Tyr Lys Lys Arg Met Thr Ala Glu Ala 
595 600 605 
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(A) LENGTH: 755 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Cys Lys Asp Ser Ala Cys Phe Ser Thr Met Lys Glu Thr Asp Leu 
10 1 5 10 15 

Glu Ala Val Ala Thr Ala Val Gin Arg Val Ala Gly Met Leu Gin Arg 
20 25 30 

15 Pro Asp Gin Leu Asp Lys Val Glu Gin Tyr Arg Arg Arg Glu Ala Arg 
35 40 45 

Lys Lys Ala Ser Val Glu Ala Arg Leu Lys Ala Ala He Gin Ser Gin 
50 55 60 

20 

Leu Asp Gly Val Arg Thr Gly Leu Ser Gin Leu His Asn Ala Leu Asn 
65 70 75 80 

Asp Val Lys Asp He Gin Gin Ser Leu Ala Asp Val Ser Lys Asp Trp 
25 85 90 95 

Arg Gin Ser He Asn Thr He Glu Ser Leu Lys Asp Val Lys Asp Ala 
100 105 110 

30 Val Val Gin His Ser Gin Leu Ala Ala Ala Val Glu Asn Leu Lys Asn 
115 120 125 

He Phe Ser Val Pro Glu He Val Arg Glu Thr Gin Asp Leu He Glu 
130 135 140 

35 

Gin Gly Ala Leu Leu Gin Ala His Arg Lys Leu Met Asp Leu Glu Cys 
145 150 155 160 

Ser Arg Asp Gly Leu Met Cys Glu Gin Tyr Arg Met Asp Ser Gly Asn 
40 165 170 175 

Lys Arg Asp Met Thr Leu He His Gly Tyr Phe Gly Ser Thr Gin Gly 
180 185 190 



45 



Leu Ser Asp Glu Leu Ala Lys Gin Leu Trp Met Val Leu Gin Arg Ser 
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195 



200 



205 



Leu Val Thr Val Arg Arg Asp Pro Thr Leu Leu Val Ser Val Val Arg 
210 215 220 

5 

lie He Glu Arg Glu Glu Lys He Asp Arg Arg He Leu Asp Arg Lys 
225 230 235 240 

Lys Gin Thr Gly Phe Val Pro Pro Gly Arg Pro Lys Asn Trp Lys Glu 
10 245 250 255 

Lys Met Phe Ala Val Leu Asp Arg Thr Val Thr Thr Arg He Glu Gly 
260 265 270 

15 Thr Gin Ala Asp Thr Arg Glu Ser Asp Lys Met Trp Leu Val Arg His 
275 280 285 

Leu Glu He' He Arg Lys Tyr Val Leu Asp Asp Leu Val He Ala Lys 
290 295 300 

20 

Asn Leu Leu Val Gin Cys Phe Pro Pro His Tyr Asp He Phe Lys Asn 
305 310 315 320 

Leu Leu Ser Met Tyr His Gin Ala Leu Ser He Arg Met Gin Asp Leu 
25 325 330 335 

Ala Ser Glu Asp Leu Glu Ala Asn Glu He Val Ser Leu Leu Thr Trp 
340 345 350 

30 Val Leu Asn Thr Tyr Thr Ser Ala Glu Met Met Gly Asn Val Glu Leu 
355 360 365 

Ala Pro Glu Val Asp Val Asn Ala Leu Glu Pro Leu Leu Ser Pro Asn 
370 375 380 

35 

Val Val Ser Glu Leu Leu Asp Thr Tyr Met Ser Thr Leu Thr Ser Asn 
385 390 395 400 

He He Ala Trp Leu Arg Lys Ala Leu Glu Thr Asp Lys Lys Asp Trp 
40 405 410 415 



Ser Lys Glu Thr Glu Pro Glu Ala Asp Gin Asp Gly Tyr Tyr Gin Thr 
420 425 430 



45 Thr Leu Pro Ala He Val Phe Gin Met Phe Glu Gin Asn Leu Gin Val 
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435 



440 



445 



Ala Ala Gin lie Ser Glu Asp Leu Lys Thr Lys Val Leu Val Leu Cys 
450 455 460 

5 

Leu Gin Gin Met Asn Ser Phe Leu Ser Arg Tyr Lys Glu Glu Ala Gin 
465 470 475 480 

Leu Tyr Lys Glu Glu His Leu Arg Asn Arg Gin His Pro His Cys Tyr 
10 485 490 495 

Val Gin Tyr Met Val Ala He He Asn Asn Cys Gin Thr Phe Lys Glu 
500 505 510 

15 Ser He He Ser Leu Lys Arg Lys Tyr Leu Lys Pro Glu Thr Glu Glu 
515 520 525 

Ser Leu Cys Gin Ser Gin Pro Ser Met Asp Gly He Leu Asp Ala He 
530 535 540 

20 

Ala Lys Glu Gly Cys Ser Ser Leu Leu Glu Glu Val Phe Leu Asp Leu 
545 550 555 560 

Glu Gin His Leu Asn Glu Leu Met Thr Lys Lys Trp Met Leu Gly Ser 
25 565 570 575 

Asn Ala Val Asp He He Cys Val Thr Val Glu Asp Tyr Phe Asn Asp 
580 585 590 

30 Phe Ala Lys He Lys Lys Pro Tyr Lys Lys Arg Met Thr Ala Glu Ala 
595 600 605 

His Arg Arg Val Val Val Glu Tyr Leu Arg Ala Val Met Gin Lys Arg 
610 615 620 

35 

He Ser Phe Arg Ser Ala Glu Glu Arg Lys Glu Gly Ala Glu Lys Met 
625 630 635 640 

Val Arg Glu Ala Glu Gin Leu Arg Phe Leu Phe Arg Lys Leu Ala Ser 
40 645 650 655 



Gly Phe Gly Glu Asp Ala Asp Gly His Cys Asp Thr He Val Ala Val 
660 665 670 



45 Ala Glu Val He Lys Leu Thr Asp Pro Ser Leu Leu Tyr Leu Glu Val 
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675 680 685 

Ser Thr Leu Val Ser Lys Tyr Pro Asp He Arg Asp Asp His He Gly 
690 695 700 

5 

Ala Leu Leu Ala Leu Arg Gly Asp Ala Ser Arg Asp Met Lys Gin Thr 
705 710 715 720 

He Met Glu Thr Leu Glu Gin Gly Pro Met Gin Ala Ser Pro Asn Tyr 
10 725 730 735 

Val Pro He Phe Gin Glu He Val Val Pro Ser Leu Asn Val Ala Lys 
740 745 750 

15 Leu Leu Lys 
755 



(2) INFORMATION FOR SEQ ID N0:3: 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2925 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
25 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



30 



35 



40 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: rsec8 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .2925 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



ATG GCG GCA GAA GCA GCT GGT GGG AAA TAC AGG AGC ACA GTC AGC AAA 
48 

45 Met Ala Ala Glu Ala Ala Gly Gly Lys Tyr Arg Ser Thr Val Ser Lys 
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10 



15 



AGC AAA GAC CCC TCG GGG CTG CTC ATC TCG GTG ATC AGG ACT CTG TCT 
96 

5 Ser Lys Asp Pro Ser Gly Leu Leu lie Ser Val lie Arg Thr Leu Ser 
20 25 30 



10 



ACC AGT GAT GAT GTT GAA GAC CGA GAA AAT GAG AAA GGT CGC CTT GAA 
Thr Ser Asp Asp Val Glu Asp Arg Glu Asn Glu Lys Gly Arg Leu Glu 
35 40 45 



144 



15 



GAA GCC TAT GAG AAG TGT GAC CGT GAC CTG GAT GAA TTG ATC GTT CAG 192 
Glu Ala Tyr Glu Lys Cys Asp Arg Asp Leu Asp Glu Leu He Val Gin 
50 55 60 

CAC TAC ACA GAA TTG ACA ACA GCC ATT CGC ACA TAC CAG AGC ATC ACA 240 
His Tyr Thr Glu Leu Thr Thr Ala He Arg Thr Tyr Gin Ser He Thr 
65 70 75 80 



20 GAA CGC ATC ACT AAC TCC AGG AAT AAA ATC AAG CAG GTA AAA GAG AAC 
Glu Arg He Thr Asn Ser Arg Asn Lys He Lys Gin Val Lys Glu Asn 
85 90 95 



288 



CTG CTT TCC TGC AAG ATG CTG TTG CAT TGC AAA CGG GAT GAG CTT CGT 
25 Leu Leu Ser Cys Lys Met Leu Leu His Cys Lys Arg Asp Glu Leu Arg 
100 105 HO 



336 



30 



AAA CTA TGG ATT GAA GGA ATT GAG CAT AAG CAT GTC CTG AAC CTT CTG 
Lys Leu Trp He Glu Gly He Glu His Lys His Val Leu Asn Leu Leu 
115 120 125 



384 



35 



GAT GAG ATT GAA AAC ATC AAG CAA GTG CCT CAA AAG CTG GAA CAG TGC 432 
Asp Glu He Glu Asn He Lys Gin Val Pro Gin Lys Leu Glu Gin Cys 
130 135 140 

ATG GCC AGC AAG CAC TAC CTC AGC GCC ACG GAC ATG CTG GTG TCA GCA 480 
Met Ala Ser Lys His Tyr Leu Ser Ala Thr Asp Met Leu Val Ser Ala 
145 150 155 160 



40 GTG GAG TCT CTG GAG GGT CCG CTG CTC CAG GTG GAA GGA CTC AGT GAC 
Val Glu Ser Leu Glu Gly Pro Leu Leu Gin Val Glu Gly Leu Ser Asp 
165 170 175 



528 



CTC AGG CTG GAA CTT CAC AGC AAG AAG ATG AAC CTG CAC TTG GTT CTC 
45 Leu Arg Leu Glu Leu His Ser Lys Lys Met Asn Leu His Leu Val Leu 
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47 
185 



190 



ATA GAG GAA CTG CAC CGA CAT CTG TAC ATC AAA TCC ACT AGC CGG GTT 624 
5 He Glu Glu Leu His Arg His Leu Tyr He Lys Ser Thr Ser Arg Val 
195 200 205 

GTA CAG CGT AAC AAG GAA AAA GGG AAG ATG AGT TCT CAT GGC AAA GAT 672 
Val Gin Arg Asn Lys Glu Lys Gly Lys Met Ser Ser His Gly Lys Asp 
10 210 215 220 

GCT TCT CCT GGT CCT CTG ATT GAT GTT TCA AAC ATC TCT ACT CCA CGC 720 
Ala Ser Pro Gly Pro Leu He Asp Val Ser Asn He Ser Thr Pro Arg 
225 230 235 240 

15 

AAA TTC CTT GAT GCC ACT CAA TAT TCT GCT GCT GGA AGC TCA AGT GTG 768 
Lys Phe Leu Asp Ala Thr Gin Tyr Ser Ala Ala Gly Ser Ser Ser Val 
245 250 255 

20 AGG GAG ATG AAC CTG CAG GAC ATC AAA GAG GAC TTG GAC TGT GAT CCT 816 
Arg Glu Met Asn Leu Gin Asp He Lys Glu Asp Leu Asp Cys Asp Pro 
260 265 270 

GAG GAG AAC AGC ACT CTT TTC ATG GGA ATT CTT ATT CAG GGG CTG GCC 864 
25 Glu Glu Asn Ser Thr Leu Phe Met Gly He Leu He Gin Gly Leu Ala 
275 280 285 

AGA CTG AAG AAG ATC CCT GAG ACC GTT AAA GCC ATT AAA GAG CGT TTG 912 
Arg Leu Lys Lys He Pro Glu Thr Val Lys Ala lie Lys Glu Arg Leu 
30 290 295 300 

GAG CAG GAG CTA AAG CAG ATT GTG AAG AGG TCA ACC ACC CAG GTG GCA 960 
Glu Gin Glu Leu Lys Gin He Val Lys Arg Ser Thr Thr Gin Val Ala 
305 310 315 320 

35 

GAC AGT GCC TAT CAG AGG GGA GAG AGC CTC ACT GTG GAC AAC CAG CCA 1008 
Asp Ser Ala Tyr Gin Arg Gly Glu Ser Leu Thr Val Asp Asn Gin Pro 
325 330 335 

40 AGG TTA CTT CTA GAA CTG CTG GAA CTG TTG TTT GAT AAG TTT AAT GCT 1056 
Arg Leu Leu Leu Glu Leu Leu Glu Leu Leu Phe Asp Lys Phe Asn Ala 
340 345 350 



45 GTA GCC AGT GCA CAT TCT ATA GTC CTG GGG TAC CTT CAG GAC TCT GTG 
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Val Ala Ser Ala His Ser He Val Leu Gly Tyr Leu Gin Asp Ser Val 
355 360 365 

GGG ACC CAG CCA ACA CAG CAG GAA GAG ATC AAA TTA TAC GAT ATG GCA 1152 
5 Gly Thr Gin Pro Thr Gin Gin Glu Glu He Lys Leu Tyr Asp Met Ala 
370 375 380 

GAT GTG TGG GTG AAG ATC CAG GAT GTG CTG CAG ATG CTG TTG ACT GAA 1200 
Asp Val Trp Val Lys He Gin Asp Val Leu Gin Met Leu Leu Thr Glu 
10 385 390 395 400 

TAC TTG GAT ATG AAG AAC ACA CGT ACT GCG TCA GAG CCG TCA GCT CAA 1248 
Tyr Leu Asp Met Lys Asn Thr Arg Thr Ala Ser Glu Pro Ser Ala Gin 
405 410 415 

15 

CTA AGC TAT GCC AGT ACT GGA CGA GAG TTC GCA GCC TTT TTT GCC AAG 1296 
Leu Ser Tyr Ala Ser Thr Gly Arg Glu Phe Ala Ala Phe Phe Ala Lys 
420 425 430 

20 AAG AAA CCA CAA AGG CCA AAG AAT TCT CTT TTC AAG TTT GAA TCA TCC 1344 
Lys Lys Pro Gin Arg Pro Lys Asn Ser Leu Phe Lys Phe Glu Ser Ser 
435 440 445 

TCC CAT GCT ATC AGT ATG AGC GCC TAT CTC CGA GAA CAG AGA AGG GAG 1392 
25 Ser His Ala He Ser Met Ser Ala Tyr Leu Arg Glu Gin Arg Arg Glu 
450 455 460 

CTG TAC AGT CGG AGT GGA GAA CTT CAA GGA GGT CCT GAT GAC AAC TTA 1440 
Leu Tyr Ser Arg Ser Gly Glu Leu Gin Gly Gly Pro Asp Asp Asn Leu 
30 465 470 475 480 

ATT GAA GGT GGA GGA ACA AAA TTT GTC TGC AAA CCT GGA GCC AGA AAT 1488 
He Glu Gly Gly Gly Thr Lys Phe Val Cys Lys Pro Gly Ala Arg Asn 
485 490 495 



35 



40 



ATT ACC GTC ATA TTC CAT CCA CTG CTG AGA TTT ATT CAG GAG ATT GAG 1536 
He Thr Val He Phe His Pro Leu Leu Arg Phe He Gin Glu He Glu 
500 505 510 



CAT GCC CTG GGA CTT GGC CCT GCC AAA CAG TGT CTC CTT CGA GAG TTT 1584 
His Ala Leu Gly Leu Gly Pro Ala Lys Gin Cys Leu Leu Arg Glu Phe 
515 520 525 



45 



CTC ACT ATA TAT ATC AAG AAC ATC TTC CTT AAT CAG GTC TTG ACT GAG 
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Leu Thr lie Tyr He Lys Asn He Phe Leu Asn Gin Val Leu Thr Glu 
530 535 540 

ATC AAC AAG GAG ATC GAA GGT GTC ACC AAA ACC TCA GAC CCC TTG AAG 1680 
5 He Asn Lys Glu He Glu Gly Val Thr Lys Thr Ser Asp Pro Leu Lys 
545 550 555 560 

ATC CTA GCT AAT GCA GAC ACC ATG AAG GTT CTG GGC GTG CAG AGG CCT 1728 
He Leu Ala Asn Ala Asp Thr Met Lys Val Leu Gly Val Gin Arg Pro 
10 565 570 575 

CTT CTA CAG AGC ACC ATC ATC GTG GAG AAG ACA GTA CAA GAC CTC ATG 1776 
Leu Leu Gin Ser Thr He He Val Glu Lys Thr Val Gin Asp Leu Met 
580 585 590 

15 

AAC CTG ATG CAT GAC TTG AGT GCA TAT TCA GAT CAG TTC CTC AAC ATG 1824 
Asn Leu Met His Asp Leu Ser Ala Tyr ser Asp Gin Phe Leu Asn Met 
595 600 605 

20 GTG TGT GTG AAG CTA CAG GAA TAC AAG GAC ACA TGC TCC ACA GCC TAC 1872 
Val Cys Val Lys Leu Gin Glu Tyr Lys Asp Thr Cys Ser Thr Ala Tyr 
610 615 620 

AGA GGC ATT GTC CAA TCA GAA GAG AAA CTT GTC ATC AGT GCA TCT TGG 1920 
25 Arg Gly He Val Gin Ser Glu Glu Lys Leu Val He Ser Ala Ser Trp 
625 630 635 640 

GCA AAA GAT GAT GAT ATC AGC AGA CTC TTG AAA TCA CTG CCG AAC TGG 1968 
Ala Lys Asp Asp Asp He Ser Arg Leu Leu Lys Ser Leu Pro Asn Trp 
30 645 650 655 

ACT AAC ATG GCT CAG CCC AAA CAG CTA AGG CCT AAG AGA GAG GAG GAG 2016 
Thr Asn Met Ala Gin Pro Lys Gin Leu Arg Pro Lys Arg Glu Glu Glu 
660 665 670 

35 

GAA GAC TTC ATA AGG GCA GCC TTT GGC AAG GAG TCT GAG GTT CTG ATT 2064 
Glu Asp Phe He Arg Ala Ala Phe Gly Lys Glu Ser Glu Val Leu He 
675 680 685 

40 

GGA AAC CTT GGT GAC AAA CTC ATT CCT CCA CAA GAT ATT CTC CGT GAC 2112 
Gly Asn Leu Gly Asp Lys Leu He Pro Pro Gin Asp He Leu Arg Asp 
690 695 700 

45 GTC AGT GAC CTC AAA GCC TTG GCC AAC ATG CAT GAA AGC CTG GAA TGG 2160 
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Val Ser Asp Leu Lys Ala Leu Ala Asn Met His Glu Ser Leu Glu Trp 
705 710 715 720 

CTG GCA GGC CGA ACA AAG TCA GCT TTC TCC AGC CTT TCT GCA TCC CAG 2208 
5 Leu Ala Gly Arg Thr Lys Ser Ala Phe Ser Ser Leu Ser Ala Ser Gin 
725 730 735 

ATG CTT TCC CCT GCT CAA GAG AGC CAC GTA AAC ATG GAC CTT CCT CCA 2256 
Met Leu Ser Pro Ala Gin Glu Ser His Val Asn Met Asp Leu Pro Pro 
10 740 745 750 

GTG TCT GAA CAG ATC ATG CAG ACG CTG. AGT GAA CTT GCC AAG TCC TTC 2304 
Val Ser Glu Gin He Met Gin Thr Leu Ser Glu Leu Ala Lys Ser Phe 
755 760 765 

15 

CAG GAC ATG GCT GAC CGC TGC TTG CTC GTC TTG CAT CTG GAA GTG AGA 2352 
Gin Asp Met Ala Asp Arg Cys Leu Leu Val i.eu His Leu Glu Val Arg 
770 775 780 

20 GTT CAT TGT TTC CAC TAT CTC ATC CCT CTG GCA AAG GAG GGG AAC TAT 24 00 

Val His Cys Phe His Tyr Leu He Pro Leu Ala Lys Glu Gly Asn Tyr 
785 790 795 800 

GCC ATT GTG GCC AAT GTG GAA AGC ATG GAT TAT GAC CCT TTA GTG GTC 2448 
25 Ala He Val Ala Asn Val Glu Ser Met Asp Tyr Asp Pro Leu Val Val 
805 810 815 

AAG CTC AAC AAA GAC ATC AGT GCC ATG GAA GAG GCC ATG AGC GCC AGC 2496 
Lys Leu Asn Lys Asp He Ser Ala Met Glu Glu Ala Met Ser Ala Ser 
30 820 825 830 



CTC CAG CAA CAC AAG TTC CAG TAC ATC TTC GAA GGT CTG GGA CAC CTC 2544 
Leu Gin Gin His Lys Phe Gin Tyr He Phe Glu Gly Leu Gly His Leu 
35 835 840 845 

ATC TCC TGT ATC CTC ATT AAT GGG GCT CAG TAC TTC CGG CGC ATC AGT 2592 
He Ser Cys He Leu He Asn Gly Ala Gin Tyr Phe Arg Arg He Ser 
850 855 860 

40 

GAG TCT GGC ATC AAG AAA ATG TGT AGA AAC ATT TTT GTT CTT CAG CAG 2640 
Glu Ser Gly He Lys Lys Met Cys Arg Asn He Phe Val Leu Gin Gin 
865 870 875 880 

45 AAT TTG ACC AAC ATC ACC ATG TCA CGG GAG GCA GAC TTG GAC TTT GCA 2688 
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Asn Leu Thr. Asn lie Thr Met Ser Arg Glu Ala Asp Leu Asp Phe Ala 
885 890 895 

AGA CAG TAG TAC GAG ATG CTG TAC AAC ACT GCT GAT GAG CTC CTG AAC 2736 
5 Arg Gin Tyr Tyr Glu Met Leu Tyr Asn Thr Ala Asp Glu Leu Leu Asn 
900 905 910 

CTG GTG GTG GAC CAG GGT GTG AAA TAC ACA GAG CTA GAG TAC ATC CAT 2784 
Leu Val Val Asp Gin Gly Val Lys Tyr Thr Glu Leu Glu Tyr lie His 
10 915 920 925 

GCC CTG ACC CTG CTG CAC CGA AGC CAG ACT GGC GTG GGG GAC CAG ACC 2832 
Ala Leu Thr Leu Leu His Arg Ser Gin Thr Gly Val Gly Asp Gin Thr 
930 935 940 

15 

ACC CAG AAC ACC AGG CTG CAG AGA CTC AAG GAG ATC ATC TGT GAG CAG 2880 
Thr Gin Asn Thr Arg Leu Gin Arg Leu Lys Glu He He Cys Glu Gin 
945 950 955 960 

20 GCT GCC ATC AAG CAA GCC ACC AAG GAC AAG AAA ATA ACC ACT GTG 2925 
Ala Ala He Lys Gin Ala Thr Lys Asp Lys Lys He Thr Thr Val 
965 970 975 



25 (2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 975 amino acids 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

35 

Met Ala Ala Glu Ala Ala Gly Gly Lys Tyr Arg Ser Thr Val Ser Lys 
15 10 15 

Ser Lys Asp Pro Ser Gly Leu Leu He Ser Val He Arg Thr Leu Ser 
40 20 25 30 

Thr Ser Asp Asp Val Glu Asp Arg Glu Asn Glu Lys Gly Arg Leu Glu 
35 40 45 



45 Glu Ala Tyr Glu Lys Cys Asp Arg Asp Leu Asp Glu Leu He Val Gin 
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50 



55 



60 



His Tyr Thr Glu Leu Thr Thr Ala lie Arg Thr Tyr Gin Ser He Thr 
65 70 75 80 

5 

Glu Arg He Thr Asn Ser Arg Asn Lys He Lys Gin Val Lys Glu Asn 
B5 90 95 

Leu Leu Ser Cys Lys Met Leu Leu His Cys Lys Arg Asp Glu Leu Arg 
10 100 105 110 

Lys Leu Trp He Glu Gly He Glu His Lys His Val Leu Asn Leu Leu 
115 120 125 

15 Asp Glu He Glu Asn He Lys Gin Val Pro Gin Lys Leu Glu Gin Cys 
130 135 140 



20 



Met Ala Ser Lys His Tyr Leu Ser Ala Thr Asp Met Leu Val Ser Ala 
145 150 155 160 

Val Glu Ser Leu Glu Gly Pro Leu Leu Gin Val Glu Gly Leu Ser Asp 
165 170 175 



25 Leu Arg Leu Glu Leu His Ser Lys Lys Met Asn Leu His Leu Val Leu 
180 185 190 

He Glu Glu Leu His Arg His Leu Tyr He Lys Ser Thr Ser Arg Val 
195 200 205 

30 

Val Gin Arg Asn Lys Glu Lys Gly Lys Met Ser Ser His Gly Lys Asp 
210 215 220 

Ala Ser Pro Gly Pro Leu He Asp Val Ser Asn He Ser Thr Pro Arg 
35 225 230 235 240 

Lys Phe Leu Asp Ala Thr Gin Tyr Ser Ala Ala Gly Ser Ser Ser Val 
245 250 255 

40 Arg Glu Met Asn Leu Gin Asp He Lys Glu Asp Leu Asp Cys Asp Pro 
260 265 270 



Glu Glu Asn Ser Thr Leu Phe Met Gly He Leu He Gin Gly Leu Ala 
275 280 285 



45 
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Arg Leu Lys Lys lie Pro Glu Thr Val Lys Ala He Lys Glu Arg Leu 
290 295 300 

Glu Gin Glu Leu Lys Gin He Val Lys Arg Ser Thr Thr Gin Val Ala 
5 305 310 315 320 

Asp Ser Ala Tyr Gin Arg Gly Glu Ser Leu Thr Val Asp Asn Gin Pro 
325 330 335 

10 Arg Leu Leu Leu Glu Leu Leu Glu Leu Leu Phe Asp Lys Phe Asn Ala 
340 345 350 

Val Ala Ser Ala His Ser He Val Leu Gly Tyr Leu Gin Asp Ser Val 
355 360 365 

15 

Gly Thr Gin Pro Thr Gin Gin Glu Glu He Lys Leu Tyr Asp Met Ala 
370 375 -*80 

Asp Val Trp Val Lys lie Gin Asp Val Leu Gin Met Leu Leu Thr Glu 
20 385 390 395 400 

Tyr Leu Asp Met Lys Asn Thr Arg Thr Ala Ser Glu Pro Ser Ala Gin 
405 410 415 

25 Leu Ser Tyr Ala Ser Thr Gly Arg Glu Phe Ala Ala Phe Phe Ala Lys 
420 425 430 



30 



Lys Lys Pro Gin Arg Pro Lys Asn Ser Leu Phe Lys Phe Glu Ser Ser 
435 440 445 

Ser His Ala He Ser Met Ser Ala Tyr Leu Arg Glu Gin Arg Arg Glu 
450 455 460 



Leu Tyr Ser Arg Ser Gly Glu Leu 
35 465 470 

He Glu Gly Gly Gly Thr Lys Phe 
485 

40 He Thr Val He Phe His Pro Leu 
500 

His Ala Leu Gly Leu Gly Pro Ala 
515 520 

45 



Gin Gly Gly Pro Asp Asp Asn Leu 
475 ; 480 

Val Cys Lys Pro Gly Ala Arg Asn 
490 495 

Leu Arg Phe He Gin Glu He Glu 
505 510 

Lys Gin Cys Leu Leu Arg Glu Phe 
525 
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Leu Thr lie Tyr He Lys Asn He Phe Leu Asn Gin Val Leu Thr Glu 
530 535 540 

He Asn Lys Glu He Glu Gly Val Thr Lys Thr Ser Asp Pro Leu Lys 
5 545 550 555 560 

He Leu Ala Asn Ala Asp Thr Met Lys Val Leu Gly Val Gin Arg Pro 
565 570 575 

10 Leu Leu Gin Ser Thr He He Val Glu Lys Thr Val Gin Asp Leu Met 
580 585 590 

Asn Leu Met His Asp Leu Ser Ala Tyr Ser Asp Gin Phe Leu Asn Met 
595 600 605 

15 

Val Cys Val Lys Leu Gin Glu Tyr Lys Asp Thr Cys Ser Thr Ala Tyr 
610 615 620 

Arg Gly He Val Gin Ser Glu Glu Lys Leu Val He Ser Ala Ser Trp 
20 625 630 635 640 

Ala Lys Asp Asp Asp He Ser Arg Leu Leu Lys Scr Leu Pro Asn Trp 
645 650 655 

25 Thr Asn Met Ala Gin Pro Lys Gin Leu Arg Pro Lys Arg Glu Glu Glu 
660 665 670 

Glu Asp Phe He Arg Ala Ala Phe Gly Lys Glu Ser Glu Val Leu He 
675 680 685 

30 

Gly Asn Leu Gly Asp Lys Leu He Pro Pro Gin Asp He Leu Arg Asp 
690 695 700 

Val Ser Asp Leu Lys Ala Leu Ala Asn Met His Glu Ser Leu Glu Trp 
35 705 710 715 720 

Leu Ala Gly Arg Thr Lys Ser Ala Phe Ser Ser Leu Ser Ala Ser Gin 
725 730 735 



40 Met Leu Ser Pro Ala Gin Glu Ser His Val Asn Met Asp Leu Pro Pro 
740 745 750 



45 



Val Ser Glu Gin He Met Gin Thr Leu Ser Glu Leu Ala Lys Ser Phe 
755 760 765 
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Gin Asp Met Ala Asp Arg Cys Leu Leu Val Leu His Leu Glu Val Arg 
770 775 780 

Val His Cys Phe His Tyr Leu He Pro Leu Ala Lys Glu. Gly Asn Tyr 
5 785 790 795 800 

Ala He Val Ala Asn Val Glu Ser Met Asp Tyr Asp Pro Leu Val Val 
805 810 815 

10 Lys Leu Asn Lys Asp He Ser Ala Met Glu Glu Ala Met Ser Ala Ser 
820 825 830 

Leu Gin Gin His Lys Phe Gin Tyr He Phe Glu Gly Leu Gly His Leu 
835 840 845 



15 



He Ser Cys He Leu He Asn Gly Ala Gin Tyr Phe Arg Arg He Ser 
850 855 860 



20 865 



Glu Ser Gly He Lys Lys Met Cys Arg Asn He Phe Val Leu Gin Gin 
870 875 880 



Asn Leu Thr Asn He Thr Met Ser Arg Glu Ala Asp Leu Asp Phe Ala 
885 890 895 



25 Arg Gin Tyr Tyr Glu Met Leu Tyr Asn Thr Ala Asp Glu Leu Leu Asn 
900 905 910 

Leu Val Val Asp Gin Gly Val Lys Tyr Thr Glu Leu Glu Tyr He His 
915 920 925 



30 



Ala Leu Thr Leu Leu His Arg Ser Gin Thr Gly Val Gly Asp Gin Thr 
930 935 940 



35 945 



Thr Gin Asn Thr Arg Leu Gin Arg Leu Lys Glu He He Cys Glu Gin 
950 955 960 



Ala Ala He Lys Gin Ala Thr Lys Asp Lys Lys He Thr Thr Val 
965 970 975 



40 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 
45 (B) TYPE: amino acid 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

5 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

10 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: rsec6.3 



(xi) SEQUENCE DESCRIPTION: SEQ ID.NO:7: 

15 

Ala Ala He Gin Ser Gin Leu Asp Gly Val Arg Thr Gly Leu Ser Gin 
15 10 15 



20 (2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
25 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



30 



35 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: rsec8.1 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



Xaa Ala Pro Glu Gly Pro Leu He Asp Val Xaa Asn He 
40 1 5 10 



45 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

<vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: rsec8.2 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Glu Phe Ala Ala Phe Phe Ala Lys 
1 5 

20 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 
25 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

30 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
35 (C) INDIVIDUAL ISOLATE: rsec8.3 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 

40 Xaa Leu Gly Val Gin Arg Pro Leu Leu Gin Ser Thr Ser He Xaa Glu 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 11: 

45 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
10 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: p71.3 



15 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Leu His Leu lie Ala Gin Glu Leu Pro Phe Asp Arg Phe Ser Glu Val 
15 10 15 



Lys 



(2) INFORMATION FOR SEQ ID NO: 14: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
35 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: p79.1 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Xaa Thr Asp Tyr He Ala Glu 
1 5 

45 
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(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 
5 (B) TYPE : amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

10 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 
15 (C) INDIVIDUAL ISOLATE: p79.2 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

20 Glu Thr Tyr Gly Ala Phe Leu Ser Arg Ser Xaa Gly 
15 10 



(2) INFORMATION FOR SEQ ID NO: 16: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
35 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: p79.3 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Xaa Xaa Pro Pro Gin Gly Val Pro Tyr Asn Pro Ala Ser Pro 
15 10 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
10 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 
15 (C) INDIVIDUAL ISOLATE: p79.4 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 

20 Xaa Xaa Gin Glu Glu Glu Thr Leu Met Phe He Arg Gly Asn 
15 10 

(2) INFORMATION FOR SEQ ID NO: 18: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
35 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: p79.5 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Ala Leu Phe He Arg Asp Asp Xaa Gin Phe 
15 10 
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(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
10 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 
15 (C) INDIVIDUAL ISOLATE: p79.6 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

20 Asn Leu Pro Val Phe Gin Ser Cys Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 20: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
35 (iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: p79.7 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

Ala Val Glu Tyr Phe Gin Asp Lys Phe Pro Asp 
15 10 

45 



WO 98/28419 



PCT/US97/23498 



64 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
10 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE : NO 

(vi) ORIGINAL SOURCE: 
15 (C) INDIVIDUAL ISOLATE: p79.8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2l: 

20 Tyr Arg Val Glu Gin Val Gly Asp Met He Asp Arg Leu Phe Asp Thr 
15 10 15 

Ser 

(2) INFORMATION FOR SEQ ID NO: 22: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
35 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: p79.9 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Val Tyr Glu Asp Pro Ala Leu Ser Ala He Phe Leu His Asn Asn Tyr 
15 10 15 

45 
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Asn Tyr 

(2) INFORMATION FOR SEQ ID NO: 23: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

10 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL : NO 
15 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: p79.10 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Xaa Xaa Tyr Gly Ala Phe Leu His Arg Tyr Ser Ser Val Pro Phe Val 
1 5 10 15 

25 

Tyr Gly Xaa His 
20 



30 (2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

40 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: p84 . 1 



45 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Gin Leu Ser Gin Gin Ser Asp Gly Asp Arg Asp Leu Gin Glu Trp Arg 
15 10 15 

5 

Gin Arg Val Gin Ala Leu Ala Glu Glu Thr Ala Gin Tyr Lys 
20 25 30 



10 (2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



20 



25 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: p84.2 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 



Xaa Leu Gin Leu Ser Phe Asn Phe Ser Glu Pro Asn Arg Gin Arg Pro 
30 1 5 10 15 



(2) INFORMATION FOR SEQ ID NO: 26: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

40 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

45 
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(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: p84.3 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 
Ser lie Pro Leu Ala Leu Leu Pro Ala Ala Ala Ala Gly Ala 
15 10 



10 (2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



20 



25 



(vi) ORIGINAL SOURCE: 

<C) INDIVIDUAL ISOLATE: p84.4 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Asp Ala Val Xaa Gin Asn Ser Thr Gin Ala Ala Glu Thr Glu Asn 
30 1 5 10 15 



(2) INFORMATION FOR SEQ ID NO: 28: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

40 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 



45 
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(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: p84 . 5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 



Asp Tyr Arg Asn Asp Glu Ala 
1 5 



10 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



20 



(ii) MOLECULE TYPE : peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



25 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: p84 . 6 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

30 

Glu Asn Asn Pro Glu Glu Asp Asp Pro Ser 
15 10 



35 (2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

45 
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(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: p84.7 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Xaa Leu Ser Gin Gin Ser Asp Xaa Gly 
10 1 5 

(2) INFORMATION FOR SEQ ID NO: 31: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE : NO 

25 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: p84 . 8 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 

Ala Ala Ala Leu Arg Ala Pro Pro Xaa Val Thr Ser 
15 10 

35 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 amino acids 
40 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

45 (iii) HYPOTHETICAL: NO 
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(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: p84 . 9 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Xaa Lys Arg Glu Pro Leu Glu 
10 1 5 



(2) INFORMATION FOR SEQ ID NO: 33: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 



25 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: p96 . 1 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Val Val Gly Gin Phe Pro Phe Gin Asp Thr Glu Leu Glu Lys 
15 10 

35 (2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

45 
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(iv) ANTI-SENSE : NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: p96.2 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 

Xaa Val Tyr Glu He Phe Asp Asn 
10 1 5 

(2) INFORMATION FOR SEQ ID NO: 35: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

25 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: p96.3 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 

Asp Phe Leu Glu Ser He Arg 
1 5 

35 (2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

45 



5 
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(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: p96.4 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 

Xaa Asp Gin Asp Leu Gin Leu Ala Asp Tyr Asp His Met Thr 
10 1 5 10 

(2) INFORMATION FOR SEQ ID NO: 37: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



25 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: p96.5 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 



Ser Thr Asn Leu Leu Leu Thr Arg Thr Leu Xaa Asn 
15 10 

35 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

45 
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(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

5 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: p!02.1 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

10 

Tyr Leu Ser Gly Leu Gin Ala Pro Gly Xaa Pro Ala Ser Gin Ser He 
15 10 15 

Gly Ala Gin 

15 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

25 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

30 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: pl02.2 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

35 

Gly Gly Leu Ser Thr Phe Phe Glu Ala Gin Asp Ala Leu Ser Ala He 
15 10 15 

His Gin Lys 



(2) INFORMATION FOR SEQ ID NO: 40: 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

5 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

10 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: pl02.3 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

15 

Ala Ser Asn Thr Ala Asp Thr Xaa Arg Gin Glu Arg 
15 10 



20 (2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
25 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



30 



35 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: pl02.4 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Xaa Arg Glu Asn Tyr lie Glu Gly Gly lie 
40 1 5 10 



45 



(2) INFORMATION FOR SEQ ID NO: 42: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: pl02.5 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 

Glu Asn Leu Gly Arg Leu Phe Glu Asn Tyr He 
15 10 



20 (2) INFORMATION FOR SEQ ID NO:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
25 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



30 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: pl02.6 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 

Xaa Asp Tyr Asp Val Val He Asn Asp Tyr Glu 
40 1 5 10 



(2) INFORMATION FOR SEQ ID NO: 44: 



45 



(i) SEQUENCE CHARACTERISTICS: 



WO 98/28419 



PCT/US97/23498 



76 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

5 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

<iv) ANTI- SENSE: NO 

10 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: pl02 . 7 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 

Xaa He Pro Xaa Leu Ser Thr Arg Pro Ala Asn Pro 
15 10 

20 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 9 amino acids 
25 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

30 (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 
35 (C) INDIVIDUAL ISOLATE: pl06.1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

40 Xaa Xaa Tyr Gly Glu He Ala Xaa Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 46: 

45 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

10 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: pl06.2 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Xaa Ala Thr Val Ser Leu Pro Glu Lys 
1 5 

20 . 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

30 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

35 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: pi 06. 3 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 

40 

Xaa Asp Tyr Gly Val lie Ala Asn Asp 
1 5 



45 (2) INFORMATION FOR SEQ ID NO:48: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
10 (iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: pl06.4 



15 



20 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Leu He Lys Tyr Phe Phe Met Val Ala Ser Val Lys 
15 10 



(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



35 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: pl06.5 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: 

Glu Leu Pro Glu Phe Asn Leu His Phe Phe Lys 
15 10 



(2) INFORMATION FOR SEQ ID NO: 50: 

45 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

10 (iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

<C) INDIVIDUAL ISOLATE: pl06.6 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Xaa Leu Gin Asp Val Asp Leu Ala Ser Xaa Arg 
15 10 

20 

(2) INb ORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

30 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

35 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: pl06.7 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
40 Xaa Asn Arg Xaa Asn Glu Pro Ala Val Asn Val Leu 
15 10 



(2) INFORMATION FOR SEQ ID NO: 52: 

45 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
10 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: pl06.8 



15 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Xaa Gin Leu Xaa Asn He Val Glu Pro Glu Xaa He Tyr 
15 10 



(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS : 
25 (A) LENGTH: 2127 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



35 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: p71 (rseclO) 



40 (ix) FEATURE: 

(A) NAME /KEY: Coding Sequence 

(B) LOCATION: 1. . .2124 



45 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 53: 
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ATG GCC ACG ACG GCC GAG CTC TTC GAG GAG CCT TTT GTG GCC GAT GAA 
Met Ala Thr Thr Ala Glu Leu Phe Glu Glu Pro Phe Val Ala Asp Glu 
1 5 10 15 



48 



TAC ATT GAA CGT CTC GTG TGG AGA ACC CCG GGA GGT GGT TCT AGG GGT 
Tyr He Glu Arg Leu Val Trp Arg Thr Pro Gly Gly Gly Ser Arg Gly 
20 25 30 



96 



GGA CCT GAA GCT TTT GAT CCT AAA AGG TTA TTA GAA GAA TTT GTG AAT 
10 Gly Pro Glu Ala Phe Asp Pro Lys Arg Leu Leu Glu Glu Phe Val Asn 
35 40 45 



144 



15 



CAT ATT CAA GAA CTA CAG ATA ATG GAT GAA AGG ATT CAA AGG AAA GTA 
His He Gin Glu Leu Gin He Met Asp Glu Arg He Gin Arg Lys Val 
50 55 60 



192 



20 



GAG AAG TTA GAA CAG CAG TGT CAG AAG GAG GCA AAG GAG TTT GCT AAG 240 
Glu Lys Leu Glu Gin Gin Cys Gin Lys Glu Ala Lys Glu Phe Ala Lys 
65 70 75 80 

AAG GTA CAA GAG CTG CAG AAA AGC AAC CAG GTT GCT TTC CAA CAT TTC 288 
Lys Val Gin Glu Leu Gin Lys Ser Asn Gin Val Ala Phe Gin His Phe 
85 90 95 



25 CAA GAA CTA GAT GAA CAC ATT AGC TAT GTA GCT ACC AAA GTG TGT CAC 
Gin Glu Leu Asp Glu His He Ser Tyr Val Ala Thr Lys Val Cys His 
100 105 HO 



336 



CTT GGA GAC CAG TTG GAA GGG GTA AAC ACA CCT AGA CAG CGT GCA GTG 
30 Leu Gly Asp Gin Leu Glu Gly Val Asn Thr Pro Arg Gin Arg Ala Val 
115 120 125 



384 



35 



GAG GCG CAG AAA TTG ATG AAA TAC TTT AAT GAG TTT CTA GAT GGA GAA 
Glu Ala Gin Lys Leu Met Lys Tyr Phe Asn Glu Phe Leu Asp Gly Glu 
130 135 140 



432 



40 



TTG AAA TCT GAT GTT TTT ACA AAT CCT GAA AAG ATC AAG GAA GCA GCA 480 
Leu Lys Ser Asp Val Phe Thr Asn Pro Glu Lys He Lys Glu Ala Ala 
145 150 155 160 

GAC GTC ATT CAG AAA CTG CAT CTG ATC GCC CAG GAG TTA CCT TTC GAT 528 
Asp Val He Gin Lys Leu His Leu He Ala Gin Glu Leu Pro Phe Asp 
165 170 175 



45 



AGG TTT TCA GAA GTA AAA TCC AAA ATT GCA AGT AAA TAC CAC GAC TTA 



576 
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Arg Phe Ser Glu Val Lys Ser Lys lie Ala Ser Lys Tyr His Asp Leu 
180 185 190 



GAA TGC CAG CTG ATT CAG GAG TTT ACC AGT GCA CAA CGA AGA GGT GAG 
Glu Cys Gin Leu lie Gin Glu Phe Thr Ser Ala Gin Arg Arg Gly Glu 
195 200 205 



624 



10 



GTC TCC CGG ATG AGG GAG GTG GCA GCT GTC CTG CTT CAT TTT AAG GGT 
Val Ser Arg Met Arg Glu Val Ala Ala Val Leu Leu His Phe Lys Gly 
210 215 220 



672 



15 



TAT TCC CAC TGT ATT GAT GTT TAT ATA AAG CAG TGC CAG GAG GGT GCT 720 
Tyr Ser His Cys He Asp Val Tyr He Lys Gin Cys Gin Glu Gly Ala 
225 230 235 240 

TAC TTG AGA AAT GAT ATA TTC GAA GAC GCA GCA ATT CTC TGT CAG CGA 768 
Tyr Leu Arg Asn Asp He Phe Glu Asp Ala Ala He Leu Cys Gin Arg 
245 250 255 



20 GTG AAC AAG CAA GTT GGA GAT ATC TTT AGT AAT CCA GAA GCA GTA CTG 
Val Asn Lys Gin Val Gly Asp He Phe Ser Asn Pro Glu Ala Val Leu 
260 265 270 



816 



25 GCT AAG CTT ATT CAG AAT GTG TTT GAA GTC AAA CTA CAG AGT TTT GTG 864 
Ala Lys Leu He Gin Asn Val Phe Glu Val Lys Leu Gin Ser Phe Val 
275 280 285 

AAA GAT CAG TTA GAA GAA TGT AGG AAA TCG GAT GCT GAG CAG TAT CTT 912 
30 Lys Asp Gin Leu Glu Glu Cys Arg Lys Ser Asp Ala Glu Gin Tyr Leu 
290 295 300 

AAA AGT CTG TAT GAT CTG TAC ACA AGA ACC ACC AGT CTT TCC AGC AAA 960 
Lys Ser Leu Tyr Asp Leu Tyr Thr Arg Thr Thr Ser Leu Ser Ser Lys 
35 305 310 315 320 

TTG ATG GAG TTT AAC TTA GGT ACT GAT AAA CAG ACT TTC TTG TCT AAG 1008 
Leu Met Glu Phe Asn Leu Gly Thr Asp Lys Gin Thr Phe Leu Ser Lys 
325 330 335 



40 



CTT ATC AAA TCC ATT TTC GTT TCC TAT CTG GAG AAC TAT ATT GAA GTG 1056 
Leu lie Lys Ser He Phe Val Ser Tyr Leu Glu Asn Tyr He Glu Val 
340 345 350 



45 



GAG ATT GGA TAT TTG AAA AGC AGA AGT GCT ATG ATC CTA CAG CGC TAT 



1104 
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Glu He Gly Tyr Leu Lys Ser Arg Ser Ala Met He Leu Gin Arg Tyr 
355 360 365 

TAT GAT TCA AAA AAC CAC CAA AAG AGA TCC ATT GGC ACA GGA GGT ATT 1152 
5 Tyr Asp Ser Lys Asn His Gin Lys Arg Ser He Gly Thr Gly Gly He 
370 375 380 

CAA GAT TTG AAA GAA AGG ATT AGA CAA CGT ACC AAC TTA CCA CTG GGG 1200 
Gin Asp Leu Lys Glu Arg He Arg Gin Arg Thr Asn Leu Pro Leu Gly 
10 385 390 395 400 

CCA AGT ATT GAC ACA CAT GGG GAG ACT TTC CTA TCT CAA GAA GTG GTA 1248 
Pro Ser He Asp Thr His Gly Glu Thr Phe Leu Ser Gin Glu Val Val 
405 410 415 

15 

GTT AAT CTT TTA CAA GAA ACC AAA CAA GCC TTT GAA AGA TGT CAT AGG 1296 
Val Asn Leu Leu Gin Glu Thr Lys Gin Ala Phe Glu Arg Cys His Arg 
420 425 430 

20 

CTT TCT GAT CCT TCT GAT TTA CCA AGG AAT GCC TTT AGA ATT TTT ACC 1344 
Leu Ser Asp Pro Ser Asp Leu Pro Arg Asn Ala Phe Arg He Phe Thr 
435 440 445 

25 ATT CTT GTG GAA TTT TTA TGT ATT GAG CAT ATT GAT TAT GCT TTA GAA 1392 
He Leu Val Glu Phe Leu Cys He Glu His He Asp Tyr Ala Leu Glu 
450 455 460 

ACG GGG CTT GCT GGA ATT CCA TCA TCA GAT TCT AGG AAC GCA AAT CTC 1440 
30 Thr Gly Leu Ala Gly He Pro Ser Ser Asp Ser Arg Asn Ala Asn Leu 
465 470 475 480 

TAT TTT TTG GAT GTT GTA CAA CAG GCC AAT ACT ATT TTT CAT CTT TTT 1488 
Tyr Phe Leu Asp Val Val Gin Gin Ala Asn Thr He Phe His Leu Phe 
35 485 490 495 

GAC AAA CAG TTT AAT GAT CAC CTA ATG CCA CTA ATA AGC TCT TCT CCT 1536 
Asp Lys Gin Phe Asn Asp His Leu Met Pro Leu He Ser Ser Ser Pro 
500 505 510 

40 

AAG TTG TCT GAA TGC CTC CAG AAG AAG AAG GAG ATA ATT GAG CAG ATG 1584 
Lys Leu Ser Glu Cys Leu Gin Lys Lys Lys Glu He He Glu Gin Met 
515 520 525 

45 GAG ATG AAG TTG GAC ACT GGC ATC GAT AGG ACG TTA AAT TGT ATG ATT 1632 
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Glu Met Lys Leu Asp Thr Gly He Asp Arg Thr Leu Asn Cys Met He 

530 535 540 

GGA CAG ATG AAG CAC ATC TTG GCT GCA GAA CAA AAG AAA ACA GAT TTT 1680 
5 Gly Gin Met Lys His He Leu Ala Ala Glu Gin Lys Lys Thr Asp Phe 
545 550 555 560 

AAG CCA GAA GAT GAA AAC AAT GTT TTG ATT CAG TAT ACT AAT GCA TGT 1728 
Lys Pro Glu Asp Glu Asn Asn Val Leu He Gin Tyr Thr Asn Ala Cys 
10 565 570 575 

GTA AAG GTT TGT GCT TAT GTA AGA AAA CAA GTT GAG AAG ATT AAG AAT 1776 
Val Lys Val Cys Ala Tyr Val Arg Lys Gin Val Glu Lys He Lys Asn 
580 585 590 

15 

TCC ATG GAT GGG AAG AAT GTG GAC ACG GTT CTG ATG GAG CTC GGA GTA 1824 
Ser Met Asp Gly Lys Asn Val Asp Thr Val Leu Met Glu Leu Gly Val 
595 600 605 

20 CGT TTC CAC CGG CTT ACT TAT GAG CAT CTT CAA CAA TAT TCC TAC AGT 1872 
Arg Phe His Arg Leu Thr Tyr Glu His Leu Gin Gin Tyr Ser Tyr Ser 
610 615 620 

TGT ATG GGC GGC ATG CTT GCA ATT TGC GAC GTT GCT GAA TAT AGA AAA 1920 
25 Cys Met Gly Gly Met Leu Ala He Cys Asp Val Ala Glu Tyr Arg Lys 
625 630 635 640 

TGT GCC AAA GAC TTC AAG ATT CCA ATG GTA TTA CAT CTT TTT GAT ACT 1968 

30 Cys Ala Lys Asp Phe Lys He Pro Met Val Leu His Leu Phe Asp Thr 

645 650 655 

CTG CAT GCA CTT TGC AAT CTC CTA GTC GTT GCT CCA GAT AAT TTA AAG 2016 
Leu His Ala Leu Cys Asn Leu Leu Val Val Ala Pro Asp Asn Leu Lys 
35 660 665 670 

CAA GTC TGC TCA GGA GAA CAA CTG GCC AAT CTG GAC AAG AAC ATA CTC 2064 
Gin Val Cys Ser Gly Glu Gin Leu Ala Asn Leu Asp Lys Asn He Leu 
675 680 685 



40 



CAC TCC TTC GTC CAG CTT CGA GCT GAC TAT AGA TCC GCC CGC CTG GCT 2112 
His Ser Phe Val Gin Leu Arg Ala Asp Tyr Arg Ser Ala Arg Leu Ala 
690 695 700 



45 



CGG CAC TTC AGC TGA 



2127 
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Arg His Phe Ser 
705 



5 (2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 708 amino acids 

(B) TYPE: amino acid 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

15 

Met Ala Thr Thr Ala Glu Leu Phe Glu Glu Pro Phe Val Ala Asp Glu 
15 10 15 

Tyr He Glu Arg Leu Val Trp Arg Thr Pro Gly Gly Gly Ser Arg Gly 
20 20 25 30 

Gly Pro Glu Ala Phe Asp Pro Lys Arg Leu Leu Glu Glu Phe Val Asn 

35 40 45 

His He Gin Glu Leu Gin He Met Asp Glu Arg He Gin Arg Lys Val 
50 55 60 

25 Glu Lys Leu Glu Gin Gin Cys Gin Lys Glu Ala Lys Glu Phe Ala Lys 
65 70 75 80 

Lys Val Gin Glu Leu Gin Lys Ser Asn Gin Val Ala Phe Gin His Phe 

85 90 95 

Gin Glu Leu Asp Glu His He Ser Tyr Val Ala Thr Lys Val Cys His 
30 100 105 110 

Leu Gly Asp Gin Leu Glu Gly Val Asn Thr Pro Arg Gin Arg Ala Val 

115 120 125 

Glu Ala Gin Lys Leu Met Lys Tyr Phe Asn Glu Phe Leu Asp Gly Glu 
130 135 140 

35 Leu Lys Ser Asp Val Phe Thr Asn Pro Glu Lys He Lys Glu Ala Ala 
145 150 155 160 

Asp Val He Gin Lys Leu His Leu He Ala Gin Glu Leu Pro Phe Asp 

165 170 175 

Arg Phe Ser Glu Val Lys Ser Lys He Ala Ser Lys Tyr His Asp Leu 
40 180 185 190 

Glu Cys Gin Leu He Gin Glu Phe Thr Ser Ala Gin Arg Arg Gly Glu 

195 200 205 

Val Ser Arg Met Arg Glu Val Ala Ala Val Leu Leu His Phe Lys Gly 
210 215 220 

45 Tyr Ser His Cys He Asp Val Tyr lie Lys Gin Cys Gin Glu Gly Ala 
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225 230 235 240 

Tyr Leu Arg Asn Asp He Phe Glu Asp Ala Ala He Leu Cys Gin Arg 

245 250 255 

Val Asn Lys Gin Val Gly Asp He Phe Ser Asn Pro Glu Ala Val Leu 
5 260 265 270 

Ala Lys Leu He Gin Asn Val Phe Glu Val Lys Leu Gin Ser Phe Val 

275 280 285 

Lys Asp Gin Leu Glu Glu Cys Arg Lys Ser Asp Ala Glu Gin Tyr Leu 
290 295 300 

10 Lys Ser Leu Tyr Asp Leu Tyr Thr Arg Thr Thr Ser Leu Ser Ser Lys 
305 310 315 320 

Leu Met Glu Phe Asn Leu Gly Thr Asp Lys Gin Thr Phe Leu Ser Lys 
325 330 335 



15 Leu He Lys Ser lie Phe Val Ser Tyr Leu Glu Asn Tyr He Glu Val 
340 345 350 

Glu He Gly Tyr Leu Lys Ser Arg Ser Ala Met He Leu Gin Arg Tyr 

355 360 365 

Tyr Asp Ser Lys Asn His Gin Lys Arg Ser He Gly Thr Gly Gly He 
20 370 375 380 

Gin Asp Leu Lys Glu Arg He Arg Gin Arg Thr Asn Leu Pro Leu Gly 
385 390 395 400 

Pro Ser He Asp Thr His Gly Glu Thr Phe Leu Ser Gin Glu Val Val 
405 410 415 

25 Val Asn Leu Leu Gin Glu Thr Lys Gin Ala Phe Glu Arg Cys His Arg 
420 425 430 

Leu Ser Asp Pro Ser Asp Leu Pro Arg Asn Ala Phe Arg He Phe Thr 

435 440 445 

He Leu Val Glu Phe Leu Cys He Glu His He Asp Tyr Ala Leu Glu 
30 450 455 460 

Thr Gly Leu Ala Gly He Pro Ser Ser Asp Ser Arg Asn Ala Asn Leu 
465 470 475 480 

Tyr Phe Leu Asp Val Val Gin Gin Ala Asn Thr He Phe His Leu Phe 
485 490 495 

35 Asp Lys Gin Phe Asn Asp His Leu Met Pro Leu He Ser Ser Ser Pro 
500 505 510 

Lys Leu Ser Glu Cys Leu Gin Lys Lys Lys Glu He He Glu Gin Met 

515 520 525 

Glu Met Lys Leu Asp Thr Gly He Asp Arg Thr Leu Asn Cys Met He 
40 530 535 540 

Gly Gin Met Lys His He Leu Ala Ala Glu Gin Lys Lys Thr Asp Phe 
545 550 555 560 

Lys Pro Glu Asp Glu Asn Asn Val Leu He Gin Tyr Thr Asn Ala Cys 
565 570 575 

45 Val Lys Val Cys Ala Tyr Val Arg Lys Gin Val Glu Lys He Lys Asn 
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10 



20 



30 



B7 

580 585 590 

Ser Met Asp Gly Lys Asn Val Asp Thr Val Leu Met Glu Leu Gly Val 

595 600 605 

Arg Phe His Arg Leu Thr Tyr Glu His Leu Gin Gin Tyr Ser Tyr Ser 

610 615 620 

Cys Met Gly Gly Met Leu Ala He Cys Asp Val Ala Glu Tyr Arg Lys 
625 630 635 640 

Cys Ala Lys Asp Phe Lys He Pro Met Val Leu His Leu Phe Asp Thr 
645 650 655 



Leu His Ala Leu Cys Asn Leu Leu Val Val Ala Pro Asp Asn Leu Lys 

660 665 670 

Gin Val Cys Ser Gly Glu Gin Leu Ala Asn Leu Asp Lys Asn He Leu 
675 680 685 

15 His Ser Phe Val Gin Leu Arg Ala Asp Tyr Arg Ser Ala Arg Leu Ala 
690 695 700 

Arg His Phe Ser 
705 



(2) INFORMATION FOR SEQ ID NO:55: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2328 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANT I -SENSE: NO 



35 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: p79 (rexo70) 

( ix) FEATURE : 

(A) NAME /KEY: Coding Sequence 
40 (B) LOCATION: 298... 2256 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
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TCAGGTGTAG CTGCATGCTC TGAACACAGA CACTCATGTT CTTCACCTGC TGAGGTAGCC 
60 

AAGACATTTC CATCAGGCTC TTGGTGTGTC AAGCTTTCTG CTGAGTCTGT GACATTTAGG 120 
TCCTGGACAT GAACTACCAT TGTGGGGACC TTTGGGAAGA GGCTCAAAGG GCTTATGTCT 180 
TAAATCAATA TTTGTTTCCT TTTTGTCTGG TCACATTTGG GCTGTTCGAG CTGCCATGTC 240 
TTGGAGCAGG GTTTGGAGCT AAGACATAGG GGAGTGGGAA CAGCGCTGTC CGCTGCG ATG 300 

Met 
1 



10 ATT CCC CCG CAG GAG GCT TCC GCT CGG CGG CGG GAG ATC GAG GAC AAG 
He Pro Pro Gin Glu Ala Ser Ala Arg Arg Arg Glu He Glu Asp Lys 
5 10 15 



348 



CTG AAG CAG GAG GAG GAG ACG CTG TCC TTT ATT CGA GAC AGC CTG GAG 
15 Leu Lys Gin Glu Glu Glu Thr Leu Ser Phe He Arg Asp Ser Leu Glu 
20 25 30 



396 



20 



AAG AGC GAC CAG CTC ACT AAG AAC ATG GTG TCT ATC CTG TCG TCC TTT 
Lys Ser Asp Gin Leu Thr Lys Asn Met Val Ser He Leu Ser Ser Phe 
35 40 45 



444 



25 



GAG AGC CGA CTC ATG AAG CTG GAG AAC TCC ATC ATC CCT GTG CAC AAG 492 
Glu Ser Arg Leu Met Lys Leu Glu Asn Ser He He Pro Val His Lys 
50 55 60 65 

CAG ACA GAG AAC CTA CAG AGG CTG CAG GAG AAT GTC GAA AAG ACC TTA 540 
Gin Thr Glu Asn Leu Gin Arg Leu Gin Glu Asn Val Glu Lys Thr Leu 
70 75 80 



30 TCC TGC CTG GAC CAT GTT ATC AGC TAT TAC CAC GTA GCC AGC GAC ACG 
Ser Cys Leu Asp His Val He Ser Tyr Tyr His Val Ala Ser Asp Thr 
85 90 95 



588 



GAG AAG ATC ATC AGA GAG GGC CCC ACA GGT AGG CTG GAA GAA TAC CTG 
35 Glu Lys He lie Arg Glu Gly Pro Thr Gly Arg Leu Glu Glu Tyr Leu 



100 



105 



110 



636 



40 



GGA AGC ATG GCC AAA ATC CAG AAG GCT GTG GAG TAC TTT CAG GAC AAC 
Gly Ser Met Ala Lys He Gin Lys Ala Val Glu Tyr Phe Gin Asp Asn 
115 120 125 



684 



AGC CCA GAT AGC CCA GAG CTC AAC AAA GTG AAG CTG CTG TTT GAG CGG 
Ser Pro Asp Ser Pro Glu Leu Asn Lys Val Lys Leu Leu Phe Glu Arg 
130 135 140 145 



732 



45 
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GGG AAG GAG TCC TTG GAA TCG GAG TTC CGC AGT CTG ATG ACC AGG CAC 780 
Gly Lys Glu Ser Leu Glu Ser Glu Phe Arg Ser Leu Met Thr Arg His 
150 155 160 

5 AGC AAG GTC ATC TCC CCT GTG CTC GTC CTG GAC CTC ATC AGT GCG GAT 828 
Ser Lys Val lie Ser Pro Val Leu Val Leu Asp Leu He Ser Ala Asp 
165 170 175 

GAC GAG CTG GAG GTC CAG GAG GAC GTG GTC CTG GAG CAC CTG CCT GAG 876 
10 Asp Glu Leu Glu Val Gin Glu Asp Val Val Leu Glu His Leu Pro Glu 
180 185 190 

AGC GTG CTC CAG GAT GTG ATC CGC ATC TCC CGC TGG CTG GTG GAA TAT 924 
Ser Val Leu Gin Asp Val He Arg He Ser Arg Trp Leu Val Glu Tyr 
15 195 200 205 

GGT CGG AAC CAA GAT TTC ATG AAT GTC TAC TAT CAG ATC CGC TCT AGC 972 
Gly Arg Asn Gin Asp Phe Met Asn Val Tyr Tyr Gin He Arg Ser Ser 
210 215 220 225 

20 

CAG CTG GAC CGC TCC ATC AAG GGT CTG AAG GAG CAT TTT CGC AAA AGC 1020 
Gin Leu Asp Arg Ser He Lys Gly L-ju Lys Glu His Phe Arg Lys Ser 
230 235 240 

25 AGT TCT TCC TCT GGG GTT CCC TAC TCC CCC GCC ATC CCC AAC AAG AGG 1068 
Ser Ser Ser Ser Gly Val Pro Tyr Ser Pro Ala He Pro Asn Lys Arg 
245 250 255 

AAA GAC ACG CCT ACC AAG AAG CCC ATC AAG CGG CCA GGG AGA GAT GAC 1116 
30 Lys Asp Thr Pro Thr Lys Lys Pro He Lys Arg Pro Gly Arg Asp Asp 
260 265 270 

ATG CTG GAC GTG GAG ACA GAT GCC TAC ATT CAC TGC GTT AGT GCC TTT 1164 
Met Leu Asp Val Glu Thr Asp Ala Tyr He His Cys Val Ser Ala Phe 
35 275 280 285 j 

GTC AGG CTG GCA CAG AGT GAA TAC CAG CTG CTG ATG GGC ATC ATT CCC 1212 
Val Arg Leu Ala Gin Ser Glu Tyr Gin Leu Leu Met Gly He He Pro 
290 295 300 305 



40 



GAG CAT CAC CAG AAG AAA ACC TTC GAC TCC TTG ATA CAG GAT GCC CTA 1260 
Glu His His Gin Lys Lys Thr Phe Asp Ser Leu lie Gin Asp Ala Leu 
310 315 320 



45 



GAT GGG CTG ATG CTT GAA GGG GAG AAC ATA GTG TCC GCG GCC AGG AAA 



1308 
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Asp Gly Leu Met Leu Glu Gly Glu Asn He Val Ser Ala Ala Arg Lys 
325 330 335 

5 GCC ATC ATC CGC CAT GAC TTC TCC ACC GTG CTC ACC GTC TTC CCC ATC 1356 
Ala He He Arg His Asp Phe Ser Thr Val Leu Thr Val Phe Pro He 
340 345 350 

CTG CGA CAC CTC AAG CAG ACC AAG CCT GAG TTT GAC CAG GTG CTC CAG 1404 
10 Leu Arg His Leu Lys Gin Thr Lys Pro Glu Phe Asp Gin Val Leu Gin 
355 360 365 

GGC ACA GCA GCC AGC ACG AAG AAC AAG CTG CCA GGC CTC ATC ACC TCC 1452 
Gly Thr Ala Ala Ser Thr Lys Asn Lys Leu Pro Gly Leu He Thr Ser 
15 370 375 380 385 

ATG GAG ACC ATT GGG GCC AAA GCT CTA GAA GAC TTT GCG GAT AAC ATC 1500 
Met Glu Thr He Gly Ala Lys Ala Leu Glu Asp Phe Ala Asp Asn He 
390 395 400 

20 

AAG AAT GAT CCA GAC AAG GAG TAC AAC ATG CCT AAA GAT GGC ACC GTT 1548 
Lys ^sn Asp Pro Asp Lys Glu Tyr Asn Met Pro Lys Asp Gly Thr Val 
405 410 415 

25 CAT GAG CTC ACA AGC AAT GCC ATC CTG TTC CTA CAG CAG CTT CTG GAC 1596 
His Glu Leu Thr Ser Asn Ala He Leu Phe Leu Gin Gin Leu Leu Asp 
420 425 430 

TTC CAG GAG ACA GCA GGC GCC ATG CTG GCC TCC CAA GAA ACC AGC TCT 1644 
30 Phe Gin Glu Thr Ala Gly Ala Met Leu Ala Ser Gin Glu Thr Ser Ser 
435 440 445 

TCG GCC ACC AGC TAC AAC TCC GAG TTC AGC AAG CGA CTG CTG AGT ACC 1692 
Ser Ala Thr Ser Tyr Asn Ser Glu Phe Ser Lys Arg Leu Leu Ser Thr 
35 450 455 460 j 465 

TAC ATT TGC AAA GTC CTG GGT AAC CTG CAG CTG AAC TTG CTA AGC AAG 1740 
Tyr He Cys Lys Val Leu Gly Asn Leu Gin Leu Asn Leu Leu Ser Lys 
470 475 480 



40 



TCC AAG GTG TAT GAG GAT CCA GCT CTG AGC GCC ATC TTC CTA CAC AAC 1788 

Ser Lys Val Tyr Glu Asp Pro Ala Leu Ser Ala He Phe Leu His Asn 
485 490 495 



45 
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AAC TAC AAC TAC ATC CTC AAG TCC CTG GAG AAG TCT GAG CTG ATC CAG 1836 
Asn Tyr Asn Tyr lie Leu Lys Ser Leu Glu Lys Ser Glu Leu lie Gin 
500 505 510 

5 CTT GTG GCT GTG ACC CAG AAG ACT GCT GAG CGC TCC TAC CGG GAG CAC 1884 
Leu Val Ala Val Thr Gin Lys Thr Ala Glu Arg Ser Tyr Arg Glu His 
515 520 525 

ATT GAG CAG CAG ATC CAG ACC TAC CAG CGC AGC TGG CTA AAG GTG ACT 1932 
10 He Glu Gin Gin He Gin Thr Tyr Gin Arg Ser Trp Leu Lys Val Thr 
530 535 540 545 

GAC TAC ATC GCA GAG AAG AAT CTA CCT GTG TTC CAG CCT GGA GTC AAG 1980 
Asp Tyr He Ala Glu Lys Asn Leu Pro Val Phe Gin Pro Gly Val Lys 
15 * 550 555 560 

CTC CGG GAC AAG GAG CGG CAA ATG ATT AAG GAA CGT TTC AAG GGA TTC 2028 
Leu Arg Asp Lys Glu Arg Gin Met He Lys Glu Arg Phe Lys Gly Phe 
565 570 575 

AAT GAT GGG CTC GAA GAA CTG TGC AAG ATT CAG AAG GCC TGG GCC ATC 2076 
Asn Asp Gly Leu Glu Glu Leu Cys Lys He Gin Lys Ala Trp Ala He 
580 585 590 

25 CCA GAC ACG GAG CAG AGG GAC AAG ATC CGA CAG GCT CAG AAA AGC ATC 2124 
Pro Asp Thr Glu Gin Arg Asp Lys He Arg Gin Ala Gin Lys Ser He 
595 600 605 

GTC AAG GAG ACC TAT GGG GCC TTT CTG CAC AGG TAT AGC AGC GTG CCC 2172 
30 Val Lys Glu Thr Tyr Gly Ala Phe Leu His Arg Tyr Ser Ser Val Pro 
610 615 620 625 

TTC ACC AAA AAC CCG GAG AAG TAC ATC AAG TAC CGT GTG GAG CAG GTG 2220 
Phe Thr Lys Asn Pro Glu Lys Tyr He Lys Tyr Arg Val Glu Gin Val 
35 630 635 640 

GGC GAC ATG ATC GAC CGC CTC TTC GAC ACC TCT GCT TAAGTCGGCC AGCGCT 2272 

Gly Asp Met He Asp Arg Leu Phe Asp Thr Ser Ala 
40 645 650 

TCCGCCCAGT GCCACTGGCG TGCCTAGCAT GTCATCGGAC TCATAAACCA GTGGTA 2328 



45 (2) INFORMATION FOR SEQ ID NO: 56: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 653 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear . 

5 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:56: 

10 Met He Pro Pro Gin Glu Ala Ser Ala Arg Arg tog Glu He Glu Asp 
! 5 10 15 

Lys Leu Lys Gin Glu Glu Glu Thr Leu Ser Phe He Arg Asp Ser Leu 

20 25 30 

Glu Lys Ser Asp Gin Leu Thr Lys Asn Met Val Ser He Leu Ser Ser 
15 35 40 45 

Phe Glu Ser Arg Leu Met Lys Leu Glu Asn Ser lie He Pro Val His 

50 55 60 

Lys Gin Thr Glu Asn Leu Gin Arg Leu Gin Glu Asn Val Glu Lys Thr 
65 70 75 80 

20 Leu Ser Cys Leu Asp His Val He Ser Tyr Tyr His Val Ala Ser Asp 

85 90 95 

Thr Glu Lys He He Arg Glu Gly Pro Thr Gly Arg Leu Olu Glu Tyr 

100 105 HO 

Leu Gly Ser Met Ala Lys He Gin Lys Ala Val Glu Tyr Phe Gin Asp 
25 115 120 125 

Asn Ser Pro Asp Ser Pro Glu Leu Asn Lys Val Lys Leu Leu Phe Glu 

130 135 140 

Arg Gly Lys Glu Ser Leu Glu Ser Glu Phe Arg Ser Leu Met Thr Arg 
145 150 155 160 

30 His Ser Lys Val He Ser Pro Val Leu Val Leu Asp Leu He Ser Ala 

165 170 175 

Asp Asp Glu Leu Glu Val Gin Glu Asp Val Val Leu Glu His Leu Pro 

180 185 190 

Glu Ser Val Leu Gin Asp Val He Arg He Ser Arg Trp Leu Val Glu 
35 195 200 205 

Tyr Gly Arg Asn Gin Asp Phe Met Asn Val Tyr Tyr Gin He Arg Ser 

210 215 220 

Ser Gin Leu Asp Arg Ser He Lys Gly Leu Lys Glu His Phe Arg Lys 
225 230 235 240 

40 Ser Ser Ser Ser Ser Gly Val Pro Tyr Ser Pro Ala He Pro Asn Lys 

245 250 255 

Arg Lys Asp Thr Pro Thr Lys Lys Pro He Lys Arg Pro Gly Arg Asp 

260 265 270 

Asp Met Leu Asp Val Glu Thr Asp Ala Tyr He His Cys Val Ser Ala 
45 275 280 285 



WO 98/28419 



PCT/US97/23498 



93 

Phe val Arg Leu Ala Gin Ser Glu Tyr Gin Leu Leu Met Gly He He 

290 295 300 

Pro Glu His His Gin Lys Lys Thr Phe Asp Ser Leu He Gin Asp Ala 
305 310 315 320 

5 Leu Asp Gly Leu Met Leu Glu Gly Glu Asn He Val Ser Ala Ala Arg 

325 330 335 

Lys Ala He He Arg His Asp Phe Ser Thr Val Leu Thr Val Phe Pro 

340 345 350 

He Leu Arg His Leu Lys Gin Thr Lys Pro Glu Phe Asp Gin Val Leu 
10 355 360 365 

Gin Gly Thr Ala Ala Ser Thr Lys Asn Lys Leu Pro Gly Leu He Thr 

370 375 380 

Ser Met Glu Thr He Gly Ala Lys Ala Leu Glu Asp Phe Ala Asp Asn 
385 390 395 400 

15 He Lys Asn Asp Pro Asp Lys Glu Tyr Asn Met Pro Lys Asp Gly Thr 

405 410 415 

Val His Glu Leu Thr Ser Asn Ala He Leu Phe Leu Gin Gin Leu Leu 

420 425 430 

Asp Phe Gin Glu Thr Ala Gly Ala Met Leu Ala Ser Gin Glu Thr Ser 
20 435 440 445 



Ser Ser Ala Thr Ser Tyr Asn Ser Glu Phe Ser Lys Arg Leu Leu Ser 

450 455 460 

Thr Tyr He Cys Lys Val Leu Gly Asn Leu Gin Leu Asn Leu Leu Ser 
25 465 470 475 480 

Lys Ser Lys Val Tyr Glu Asp Pro Ala Leu Ser Ala He Phe Leu His 

485 490 495 

Asn Asn Tyr Asn Tyr He Leu Lys Ser Leu Glu Lys Ser Glu Leu He 
500 505 510 

30 Gin Leu Val Ala Val Thr Gin Lys Thr Ala Glu Arg Ser Tyr Arg Glu 
515 520 525 

His He Glu Gin Gin He Gin Thr Tyr Gin Arg Ser Trp Leu Lys Val 

530 535 540 

Thr Asp Tyr He Ala Glu Lys Asn Leu Pro Val Phe Gin Pro Gly Val 
35 545 550 555 560 

Lys Leu Arg Asp Lys Glu Arg Gin Met He Lys Glu Arg Phe Lys Gly 

565 570 575 

Phe Asn Asp Gly Leu Glu Glu Leu Cys Lys He Gin Lys Ala Trp Ala 
580 585 590 

40 He Pro Asp Thr Glu Gin Arg Asp Lys He Arg Gin Ala Gin Lys Ser 
595 600 605 

He Val Lys Glu Thr Tyr Gly Ala Phe Leu His Arg Tyr Ser Ser Val 

610 615 620 

Pro Phe Thr Lys Asn Pro Glu Lys Tyr He Lys Tyr Arg Val Glu Gin 
45 625 630 635 640 
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Val Gly Asp Met lie Asp Arg Leu Phe Asp Thr Ser Ala 
645 650 



5 



15 



20 



25 



(2) INFORMATION FOR SEQ ID NO: 57: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2496 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
10 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
<iv) ANTI- SENSE : NO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: p84 (rexo84) 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 156... 2303 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 



GGCACGAGGC GCAATGCAGC GCTCGGCCCC GGCGCAGGCG TTTGGGTGTT GCTCCCGCCC 
60 

30 CGCTTCCTCT ACCGCTTCTC CACCGGGCTC GGGTGGGGCC GGGAGGCGGC GAGTGACAGC 120 
CACTGCGGGC CACGGGAGCA GGGCGGGTGG CAGCC ATG TCG GAC AGC GGG GCG 173 

Met Ser Asp Ser Gly Ala 
1 5 

35 AGC CGC CTG CGG AGG CAG CTG GAG TCG GGG GGC TTC GAG GCG CGG CTG 221 
Ser Arg Leu Arg Arg Gin Leu Glu Ser Gly Gly Phe Glu Ala Arg Leu 
10 15 20 

TAC GTG AAG CAA CTG TCG CAG CAG TCG GAC GGC GAC CGC GAC CTG CAG 269 
40 Tyr Val Lys Gin Leu Ser Gin Gin Ser Asp Gly Asp Arg Asp Leu Gin 
25 30 35 



45 



GAG CAC CGA CAG CGG GTG CAG GCG CTG GCG GAG GAG ACG GCG CAG AAC 
Glu His Arg Gin Arg Val Gin Ala Leu Ala Glu Glu Thr Ala Gin Asn 
40 45 50 



317 
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CTG AAG CGC AAC GTC TAC CAG AAC TAC CGG CAG TTC ATC GAG ACG GCG 365 
Leu Lys Arg Asn Val Tyr Gin Asn Tyr Arg Gin Phe He Glu Thr Ala 
55 60 65 70 



5 CGC GAG ATC TCC TAC CTG GAG AGC GAG ATG TAC CAG CTG AGC CAC CTG 413 
Arg Glu He Ser Tyr Leu Glu Ser Glu Met Tyr Gin Leu Ser His Leu 
75 80 85 



CTG ACG GAG CAG AAG AGC AGT CTG GAG AGC ATC CCG CTG GCA CTG CTG 
10 Leu Thr Glu Gin Lys Ser Ser Leu Glu Ser He Pro Leu Ala Leu Leu 
90 95 100 



461 



15 



CCC GCC GCT GCC GCG GGC GCA TCC ACG GGT GAG GAC ACG GCG GGC GCG 
Pro Ala Ala Ala Ala Gly Ala Ser Thr Gly Glu Asp Thr Ala Gly Ala 
105 HO 115 



509 



20 



GGA CCG CGG GAG CGC GGG GCG GCC CAG GCG GGC TTT CTT CCG GGG CCG 557 
Gly Pro Arg Glu Arg Gly Ala Ala Gin Ala Gly Phe Leu Pro Gly Pro 
120 125 130 

GCC GGA GTC CCT CGC GAG GGC CCC GGG ACG GGA GAG GAG GGC AAG CAG 605 
Ala Gly Val Pro Arg Glu Gly Pro Gly Thr Gly Glu Glu Gly Lys Gin 
135 140 145 150 



25 CGA ACG CTC ACC ACG CTG CTG GAG AAG GTG GAG GGC TGC AGG GAC CTG 
Arg Thr Leu Thr Thr Leu Leu Glu Lys Val Glu Gly Cys Arg Asp Leu 
155 160 165 



653 



CTG GAG ACA CCA GGT CAG TAC CTT GTG TAC AAC GGG GAC CTG GTG GAA 701 
30 Leu Glu Thr Pro Gly Gin Tyr Leu Val Tyr Asn Gly Asp Leu Val Glu 
170 175 180 



TAC GAG GCA GAC CAT ATG GCC CAG CTG CAG CGA GTG CAT GGC TTC CTC 749 
Tyr Glu Ala Asp His Met Ala Gin Leu Gin Arg Val His Gly Phe Leu 
35 185 190 195 

ATG AAC GAC TGT CTG CTG GTG GCC ACT TGG CTA CCA CAA CGG CGA GGC 797 
Met Asn Asp Cys Leu Leu Val Ala Thr Trp Leu Pro Gin Arg Arg Gly 
200 205 210 

40 

ATG TAC CGC TAC AAC GCA CTC TAC CCA CTG GAC CGC CTG GCT GTG GTC 845 
Met Tyr Arg Tyr Asn Ala Leu Tyr Pro Leu Asp Arg Leu Ala Val Val 
215 220 225 230 

45 AAC GTC AAA GAC AAT CCA CCT ATG AAA GAC ATG TTC AAG CTG CTC ATG 893 
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Asn Val Lys Asp Asn Pro Pro Met Lys Asp Met Phe Lys Leu Leu Met 
235 240 245 

TTC CCT GAG AGC CGA ATC TTT CAA GCG GAA AAT GCC AAG ATT AAA CGC 941 
5 Phe Pro Glu Ser Arg He Phe Gin Ala Glu Asn Ala Lys He Lys Arg 
250 255 260 

GAG TGG CTG GAA GTG TTG GAG GAA ACC AAG AGG GCG CTC AGC GAC AAG 989 

10 Glu Trp Leu Glu Val Leu Glu Glu Thr Lys Arg Ala Leu Ser Asp Lys 
265 270 275 

AGG AGA CGG GAG CAG GAG GAG GCA GCC GCC TTG CGT GCG CCA CCA CCG 1037 
Arg Arg Arg Glu Gin Glu Glu Ala Ala Ala Leu Arg Ala Pro Pro Pro 
15 280 285 290 

GTC ACT TCC AAG GGC AGC AAC CCG TTT GAG GAT GAG GCC GAG GAG GAA 1085 
Val Thr Ser Lys Gly Ser Asn Pro Phe Glu Asp Glu Ala Glu Glu Glu 
295 300 305 310 

20 

CTG GCC ACC CCG GAG GCA GAG GAG GAA AAG GTT GAC CTT TCC ATG GAG 1133 
Leu Ala Thr Pro Glu Ala Glu Glu Glu Lys Val Asp Leu Ser Met Glu 
315 320 325 

25 TGG ATC CAG GAG TTG CCG GAA GAC CTG GAT GTT TGT ATT GCG CAG AGG 1181 
Trp He Gin Glu Leu Pro Glu Asp Leu Asp Val Cys He Ala Gin Arg 
330 335 340 

GAC TTT GAG GGT GCC GTG GAC TTG CTG GAC AAA TTA AAT CAC TAT CTT 1229 
30 Asp Phe Glu Gly Ala Val Asp Leu Leu Asp Lys Leu Asn His Tyr Leu 
345 350 355 

GAA GAT AAG CCC AGC CCA CCT TCT GTG AAA GAG CTG AGG GCC AAA GTG 1277 
Glu Asp Lys Pro Ser Pro Pro Ser Val Lys Glu Leu Arg Ala Lys Val 
35 360 365 370 

GAT GAA CGA GTG CGA CAG CTC ACC GAG GTG CTT GTG TTT GAG CTC TCC 1325 
Asp Glu Arg Val Arg Gin Leu Thr Glu Val Leu Val Phe Glu Leu Ser 
375 380 385 390 

40 

CCG GAT CGC TCT CTG AGA GGT GGC CCT AAG GCT ACT CGA AGG GCA GTG 1373 
Pro Asp Arg Ser Leu Arg Gly Gly Pro Lys Ala Thr Arg Arg Ala Val 
395 400 405 

45 TCT CAA CTG ATC CGT CTC GGC CAG TGC ACT AAG GCT TGT GAG CTG TTT 1421 
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Ser Gin Leu He Arg Leu Gly Gin Cys Thr Lys Ala Cys Glu Leu Phe 
410 415 420 

CTG AGG AAC AGG GCG GCA GCT GTG CAT ACT GCC ATC CGC CAG CTT CGA 1469 

5 

Leu Arg Asn Arg Ala Ala Ala Val His Thr Ala He Arg Gin Leu Arg 
425 430 435 

ATT GAG GGC GCC ACT CTG CTC TAT ATT CAC AAG CTG TGC CAT GTC TTC 1517 
10 He Glu Gly Ala Thr Leu Leu Tyr He His Lys Leu Cys His Val Phe 
440 445 450 

TTT ACC AGC CTC CTA GAG ACT GCA CGG GAG TTT GAG ACA GAC TTT GCA 1565 
Phe Thr Ser Leu Leu Glu Thr Ala Arg Glu Phe Glu Thr Asp Phe Ala 
15 455 460 465 470 

GGC ACG GAC AGT GGC TGC TAC TCT GCC TTT GTG GTC TGG GCA AGG TCT 1613 
Gly Thr Asp Ser Gly Cys Tyr Ser Ala Phe Val Val Trp Ala Arg Ser 
475 480 485 

20 

GCC ATG GGC ATG TTC GTG GAT GCT TTC AGC AAG CAG GTT TTT GAC AGC 1661 
Ala Met Gly Met Phe Val Asp Ala Phe Ser Lys Gin Val Phe Asp Ser 
490 495 500 

25 AAG GAG AGC CTG TCC ACT QCT GCC GAG TGT GTG AAG GTA GCC AAG GAG 1709 
Lys Glu Ser Leu Ser Thr Ala Ala Glu Cys Val Lys Val Ala Lys Glu 
505 510 515 

CAC TGC CAG CAG CTG GGA GAG ATT GGG CTG GAT CTC ACC TTC ATC ATC 1757 
30 His Cys Gin Gin Leu Gly Glu He Gly Leu Asp Leu Thr Phe He He 
520 525 530 

CAT GCC CTC CTG GTG AAG GAC ATC CAG GGG GCC TTG CTC AGT TAC AAG 1805 
His Ala Leu Leu Val Lys Asp He Gin Gly Ala Leu Leu Ser Tyr Lys 
35 535 540 545 550 

GAG ATT ATC ATT GAA GCC ACC AAG CAC CGA AAC TCG GAG GAG ATG TGG 1853 
Glu He He He Glu Ala Thr Lys His Arg Asn Ser Glu Glu Met Trp 
555 560 565 

40 

CGT CGG ATG AAC CTG ATG ACT CCT GAG GCC CTG GGC AAG CTC AAG GAG 1901 
Arg Arg Met Asn Leu Met Thr Pro Glu Ala Leu Gly Lys Leu Lys Glu 
570 575 580 

45 GAG ATG AGA AGT TGC GGG GTC AGT AAC TTT GAG CAG TAT ACG GGG GAT 1949 
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20 



Glu Met Arg Ser Cys Gly Val Ser Asn Phe Glu Gin Tyr Thr Gly Asp 
585 590 595 

5 GAC TGC TGG GTG AAC CTG AGC TAC ACT GTG GTA GCC TTC ACC AAG CAG 1997 
Asp Cys Trp Val Asn Leu Ser Tyr Thr Val Val Ala Phe Thr Lys Gin 
600 605 610 

ACC ATG GGC TTC CTG GAG GAG GCA CTG AAA CTG TAC TTC CCA GAG CTA 2045 
10 Thr Met Gly Phe Leu Glu Glu Ala Leu Lys Leu Tyr Phe Pro Glu Leu 
615 620 625 630 

CAC ATG GTG CTC CTG GAG AGT CTG GTG GAA GTC ATA CTG GTT GCT GTG 2093 
His Met Val Leu Leu Glu Ser Leu Val Glu Val He Leu Val Ala Val 
15 635 640 645 

CAG CAT GTG GAC TAC AGC CTG CGC TGT GAG CAG GAC CCA GAG AAG AAG 2141 
Gin His Val Asp Tyr Ser Leu Arg Cys Glu Gin Asp Pro Glu Lys Lys 
650 655 660 

ACT TTC ATC CGG CAG AAC GCA TCC TTC CTT TAC GAC ACT GTC CTC CCA 2189 
Thr Phe He Arg Gin Asn Ala Ser Phe Leu Tyr Asp Thr Val Leu Pro 
665 670 675 

25 GTG GTG GAG AGG AGG TTT GAG GAA GGT GTA GGG AAG CCT GCC AAG CAG 2237 
Val Val Glu Arg Arg Phe Glu Glu Gly Val Gly Lys Pro Ala Lys Gin 
680 685 690 

CTG CAG GAC CTT CGG AAT GCA TCG AGA CTC CTG CGG GTG AAT CCT GAG 2285 
30 Leu Gin Asp Leu Arg Asn Ala Ser Arg Leu Leu Arg Val Asn Pro Glu 
695 700 70S 710 

AGC ACC ACG TCT GTG GTC TGAGGCTCTG GTTCGATGCT GTGTGTGCGT CTGTCTGT 2341 
Ser Thr Thr Ser Val Val 
35 715 

CTGTCTGTCT GTATGTATGT ATGTATGTAT GTATGTCTGT ACACACACAT GTACTACAGA 2401 
TGCATTGTTT GTATTCACAT TACACAATAT CAACACATTC CTCAGACACA AACAGCATAT 2461 
AAAAGTGCCC TCTTAAAAAT GCAGTCAAAA AAAAA 2496 

40 

(2) INFORMATION FOR SEQ ID NO: 58: 



45 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 716 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:58: 

Met Ser Asp Ser Gly Ala Ser Arg Leu Arg Arg Gin Leu Glu Ser Gly 
15 10 15 

10 Gly Phe Glu Ala Arg Leu Tyr Val Lys Gin Leu Ser Gin Gin Ser Asp 
20 25 30 

Gly Asp Arg Asp Leu Gin Glu His Arg Gin Arg Val Gin Ala Leu Ala 

35 40 45 

Glu Glu Thr Ala Gin Asn Leu Lys Arg Asn Val Tyr Gin Asn Tyr Arg 
15 50 55 60 

Gin Phe He Glu Thr Ala Arg Glu He Ser Tyr Leu Glu Ser Glu Met 
65 70 75 80 

Tyr Gin Leu Ser His Leu Leu Thr Glu Gin Lys Ser Ser Leu Glu Ser 
85 90 95 

20 He Pro Leu Ala Leu Leu Pro Ala Ala Ala Ala Gly Ala Ser Thr Gly 
100 105 110 

Glu Asp Thr Ala Gly Ala Gly Pro Arg Glu Arg Gly Ala Ala Gin Ala 

115 120 125 

Gly Phe Leu Pro Gly Pro Ala Gly Val Pro Arg Glu Gly Pro Gly Thr 
25 130 135 140 

Gly Glu Glu Gly Lys Gin Arg Thr Leu Thr Thr Leu Leu Glu Lys Val 
145 150 155 160 

Glu Gly Cys Arg Asp Leu Leu Glu Thr Pro Gly Gin Tyr Leu Val Tyr 
165 170 175 

30 Asn Gly Asp Leu Val Glu Tyr Glu Ala Asp His Met Ala Gin Leu Gin 
180 185 190 

Arg Val His Gly Phe Leu Met Asn Asp Cys Leu Leu Val Ala Thr Trp 

195 200 205 

Leu Pro Gin Arg Arg Gly Met Tyr Arg Tyr Asn Ala Leu Tyr Pro Leu 
35 210 215 220 , 

Asp Arg Leu Ala Val Val Asn Val Lys Asp Asn Pro Pro Met Lys Asp 
225 230 235 240 

Met Phe Lys Leu Leu Met Phe Pro Glu Ser Arg He Phe Gin Ala Glu 
245 250 255 

40 Asn Ala Lys He Lys Arg Glu Trp Leu Glu Val Leu Glu Glu Thr Lys 
260 265 270 

Arg Ala Leu Ser Asp Lys Arg Arg Arg Glu Gin Glu Glu Ala Ala Ala 

275 280 285 

Leu Arg Ala Pro Pro Pro Val Thr Ser Lys Gly Ser Asn Pro Phe Glu 
45 290 295 300 
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Asp Glu Ala Glu Glu Glu Leu Ala Thr Pro Glu Ala Glu Glu Glu Lys 
305 310 315 320 

Val Asp Leu Ser Met Glu Trp lie Gin Glu Leu Pro Glu Asp Leu Asp 
325 330 335 

5 Val Cys He Ala Gin Arg Asp Phe Glu Gly Ala Val Asp Leu Leu Asp 
340 345 350 

Lys Leu Asn His Tyr Leu Glu Asp Lys Pro Ser Pro Pro Ser Val Lys 

355 360 365 

Glu Leu Arg Ala Lys Val Asp Glu Arg Val Arg Gin Leu Thr Glu Val 
10 370 375 380 

Leu Val Phe Glu Leu Ser Pro Asp Arg Ser Leu Arg Gly Gly Pro Lys 
385 390 395 400 

Ala Thr Arg Arg Ala Val Ser Gin Leu He Arg Leu Gly Gin Cys Thr 
405 410 415 

15 Lys Ala Cys Glu Leu Phe Leu Arg Asn Arg Ala Ala Ala Val His Thr 
420 425 430 

Ala He Arg Gin Leu Arg He Glu Gly Ala Thr Leu Leu Tyr lie His 

435 440 445 

Lys Leu Cys His Val Phe Phe Thr Ser Leu Leu Glu Thr Ala Arg Glu 
20 450 455 460 

Phe Glu Thr Asp Phe Ala Gly Thr Asp Ser Gly Cys Tyr Ser Ala Phe 
465 470 475 480 

Val Val Trp Ala Arg Ser Ala Met Gly Met Phe Val Asp Ala Phe Ser 
485 490 495 

25 Lys Gin Val Phe Asp Ser Lys Glu Ser Leu Ser Thr Ala Ala Glu Cys 
500 505 510 

Val Lys Val Ala Lys Glu His Cys Gin Gin Leu Gly Glu He Gly Leu 
515 520 525 

30 Asp Leu Thr Phe He He His Ala Leu Leu Val Lys Asp He Gin Gly 
530 535 540 

Ala Leu Leu Ser Tyr Lys Glu He He He Glu Ala Thr Lys His Arg 
545 550 555 560 

Asn Ser Glu Glu Met Trp Arg Arg Met Asn Leu Met Thr Pro Glu Ala 
35 565 570 575 

Leu Gly Lys Leu Lys Glu Glu Met Arg Ser Cys Gly Val Ser Asn Phe 

580 585 590 

Glu Gin Tyr Thr Gly Asp Asp Cys Trp Val Asn Leu Ser Tyr Thr Val 
595 600 605 

40 Val Ala Phe Thr Lys Gin Thr Met Gly Phe Leu Glu Glu Ala Leu Lys 
610 615 620 

Leu Tyr Phe Pro Glu Leu His Met Val Leu Leu Glu Ser Leu Val Glu 
625 630 635 640 

Val He Leu Val Ala Val Gin His Val Asp Tyr Ser Leu Arg Cys Glu 
45 645 650 655 
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Gin Asp Pro Glu Lys Lys Thr Phe He Arg Gin Asn Ala Ser Phe Leu 

660 665 670 

Tyr Asp Thr Val Leu Pro Val Val Glu Arg Arg Phe Glu Glu Gly Val 
675 680 685 

5 Gly Lys Pro Ala Lys Gin Leu Gin Asp Leu Arg Asn Ala Ser Arg Leu 
690 695 700 

Leu Arg Val Asn Pro Glu Ser Thr Thr Ser Val Val 
705 710 715 



10 



20 



(2) INFORMATION FOR SEQ ID NO: 59: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3059 base pairs 
!5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



25 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: p96 (rsecl5) 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 
30 (B) LOCATION: 341.. -2806 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

35 CCGCGTCGAC CCGGCACAAA CACACCATGC ACCCTGACAG TAAATTTATA AAAAATTTTT 
60 

TTCAACTATA GCACAAAAAA TGTGTATCAC TTAACAGGGA TGACAAATGG CTGTCACTTC 120 
AGTCACTATA ATGCACAATA TATATATGCC TTCTACTTAT GAAGCACGTT TACTGGAAAA 180 
CAACTGTTAT GCAGACGGCA TCCTCATTTT TCTTATGTTT GGTCTCTTGT CCCTTGAGAG 240 
40 ATCATGTCCA GTAAGGGATC TACAACAGTT CCTTGTGTGT GTTAACAATG ATGAAATCAC 300 
AATTGATGTG TTTTCCTGAG TCTGTCCCCA CAGTTAAACA ATG CCT TTC TCC CCG 355 

Met Pro Phe Ser Pro 
1 5 

45 GGG CGC CGC GCC CGG CCG GCG CCT CCG CTT CCA GCC AAG ATG GCG GAG 403 



102 

Gly Arg Arg Ala Arg Pro Ala Pro Pro Leu Pro Ala Lys Met Ala Glu 
10 15 20 

AGC GGC GAG GCT CTG GGT ACG GTC CCG GAG CAC GAG CGG ATC TTG CAG 
Ser Gly Glu Ala Leu Gly Thr Val Pro Glu His Glu Arg He Leu Gin 
25 30 35 

GAG ATC GAA AGC ACG GAC ACC GCC TGC GTA GGG CCC ACC CTT CGG TCT 
Glu He Glu Ser Thr Asp Thr Ala Cys Val Gly Pro Thr Leu Arg Ser 
40 45 50 

GTG TAT GAT GGT CAG CCA AAT GCA CAC AAG AAG TTC ATG GAA AAG CTA 
Val Tyr Asp Gly Gin Pro Asn Ala His Lys Lys Phe Met Glu Lys Leu 
55 60 65 

GAT GCC TGC ATC CGC AAC CAT GAC AAG GAG ATT GAG AAA ATG TGT AAC 
Asp Ala Cys He Arg Asn His Asp Lys Glu He Glu Lys Met Cys Asn 
70 75 80 85 

TTC CAT CAC CAG GGC TTT GTA GAT GCT ATC ACA GAG CTC CTC AAA GTA 
Phe His His Gin Gly Phe Val Asp Ala He Thr Glu Leu Leu Lys Val 
90 95 100 

AGG GCG GAC GCA GAA AAG CTG AAG GTC CAG GTT ACT GAT ACC AAC CGA 
Arg Ala Asp Ala Glu Lys Leu Lys Val Gin Val Thr Asp Thr Asn Arg 
105 HO H5 

AGG TTC CAA GAT GCT GGA AAA GAG GTG ATA GAA CAA ACA GAA GAT ATT 
Arg Phe Gin Asp Ala Gly Lys Glu Val He Glu Gin Thr Glu Asp He 
120 125 130 

ATT CGG TGT AGA ATT CAG CAG AGA AAT ATT ACA ACC GTA GTA GAG AAA 
He Arg Cys Arg He Gin Gin Arg Asn He Thr Thr Val Val Glu Lys 
135 140 145 

j 

TTG CAG TTA TGC CTG CCA GTG TTG GAG ATG TAC AGT AAA CTG AAG GAA 
Leu Gin Leu Cys Leu Pro Val Leu Glu Met Tyr Ser Lys Leu Lys Glu 
150 155 160 165 



CAG ATG AGC ATG CAG AGG TAT TAT TCT GCA CTT AAA ACA ATG GAA CAG 
Gin Met Ser Met Gin Arg Tyr Tyr Ser Ala Leu Lys Thr Met Glu Gin 
170 175 180 



TTA GAG AAC GTA TAC TTT CCT CGG GTT AGC CAG TAC CGC TTC TGT CAG 
Leu Glu Asn Val Tyr Phe Pro Arg Val Ser Gin Tyr Arg Phe Cys Gin 
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185 190 195 

CTC ATG ATG GAC ACA CTT CCT AAA CTC CGG GAG GAT ATC AAA GAC ATC 979 
Leu Met Met Asp Thr Leu Pro Lys Leu Arg Glu Asp lie Lys Asp He 
5 200 205 210 

TCC ATG TCT GAC CTC AAG GAC TTT CTG GAA AGC ATT CGA AAG CAT TCT 1027 
Ser Met Ser Asp Leu Lys Asp Phe Leu Glu Ser He Arg Lys His Ser 
215 220 225 

10 

GAC AAA ATA GGG GAA ACA GCC ATG AAA CAG GCA CAA CAG CAG AAG AGC 1075 
Asp Lys He Gly Glu Thr Ala Met Lys Gin Ala Gin Gin Gin Lys Ser 
230 235 240 245 

15 TTC AGT ATT GCT GTA CAG AAG CAA ACC AAT ATG AGG TTT GGG AAA AAT 1123 

Phe Ser He Ala Val Gin Lys Gin Thr Asn Met Arg Phe Gly Lys Asn 

250 255 260 

20 ATG CAT GTA AAC AAT GAC AGG ACT CTA GAG GAA AAG AGT GAC ATC ATA 1171 
Met His Val Asn Asn Asp Arg Thr Leu Glu Glu Lys Ser Asp He He 
265 270 275 

CTG AAG CAC ACA CTT GAG GAG GAG GCC GAG AAT GAC GAG GAG GTC TTA 1219 
25 Leu Lys His Thr Leu Glu Glu Glu Ala Glu Asn Asp Glu Glu Val Leu 
280 285 290 

ACT GTT CAA GAT CTC GTT GAC TTT TCC CCT GTT TAC CGA TGC TCA CAC 1267 
Thr Val Gin Asp Leu Val Asp Phe Ser Pro Val Tyr Arg Cys Ser His 
30 295 300 305 

ATT TAT TCT GCT TTG GGT GAT GAA GAA ACA TTT GAA AAT TAT TAC CGG 1315 
He Tyr Ser Ala Leu Gly Asp Glu Glu Thr Phe Glu Asn Tyr Tyr Arg 
310 315 320 325 



35 



40 



AAG CAA AGG AAG AAG CAG GCG CGC CTG GTT CTG CAG CCG CAG TCG AGT 1363 
Lys Gin Arg Lys Lys Gin Ala Arg Leu Val Leu Gin Pro Gin Ser Ser 
330 335 340 

GTG CAT GAA ACA GTT GAT GGC TAT AGA AGA TAT TTC ACT CAG ATT GTA 1411 
Val His Glu Thr Val Asp Gly Tyr Arg Arg Tyr Phe Thr Gin He Val 
345 350 355 



45 



GGG TTC TTT GTT GTG GAA GAT CAC ATT TTA CAT GTG ACC CAA GGA TTA 
Gly Phe Phe Val Val Glu Asp His He Leu His Val Thr Gin Gly Le 



1459 
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535 540 545 

TGC TTA CTG AAC CTC ATT AGA AAA CCT CAC ATA GGT TTG ACA GAG CTG 2035 
Cys Leu Leu Asn Leu He Arg Lys Pro His He Gly Leu Thr Glu Leu 
5 550 555 560 565 

GTG CAA ATT ATC ATA AAC ACA ACA CAC CTG GAG CAG GCT TGC AAA TAC 2083 

Val Gin He He He Asn Thr Thr His Leu Glu Gin Ala Cys Lys Tyr 
10 570 575 580 

CTG GAG GAC TTT ATA ACT AAC ATT ACA AAT ATT TCT CAA GAA ACC GTT 2131 
Leu Glu Asp Phe He Thr Asn He Thr Asn He Ser Gin Glu Thr Val 
585 590 595 

CAC ACC ACA AGA CTT TAT GGA CTT TCT ACT TTT AAG GAT GCA CGA CAT 2179 
His Thr Thr Arg Leu Tyr Gly Leu Ser Thr Phe Lys Asp Ala Arg His 
600 605 610 

20 GCA GCG GAA GGA GAA ATA TAT ACC AAA CTC AAT CAG AAG ATT GAT GAG 2227 
Ala Ala Glu Gly Glu He Tyr Thr Lys Leu Asn Gin Lys He Asp Glu 
615 620 625 

TTT GTC CAG CTG GCC GAC TAC GAC TGG ACC ATG GCT GAG TCG GAT GGG 2275 
25 Phe Val Gin Leu Ala Asp Tyr Asp Trp Thr Met Ala Glu Ser Asp Gly 
630 635 640 645 

AGA GCA AGT GGT TAT TTA ATG GAT CTT ATT AAT TTT TTG AGA AGC ATC 2323 
Arg Ala Ser Gly Tyr Leu Met Asp Leu He Asn Phe Leu Arg Ser He 
30 650 655 660 

TTT CAA GTA TTT ACC CAC TTG CCT GGG AAA GTT GCC CAG ACA GCT TGC 2371 
Phe Gin Val Phe Thr His Leu Pro Gly Lys Val Ala Gin Thr Ala Cys 
665 670 675 

35 { 

ATG TCA GCC TGC CAA CAT CTG TCC ACA TCC TTG ATG CAG ATG CTC CTG 2419 
Met Ser Ala Cys Gin His Leu Ser Thr Ser Leu Met Gin Met Leu Leu 
680 685 690 

40 GAC AGC GAG TTA AAG CAG ATC AGC ATG GGG GCC GTA CAG CAG TTT AAC 2467 
Asp Ser Glu Leu Lys Gin He Ser Met Gly Ala Val Gin Gin Phe Asn 
695 700 70S 

CTA GAC GTC ATA CAG TGC GAA TTG TTT GCC AGC TCT GAG CCT GTG CCA 2515 
45 Leu Asp Val He Gin Cys Glu Leu Phe Ala Ser Ser Glu Pro Val Pro 
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710 715 720 725 

GGA TTC CAA GGG GAC ACC CTG CAG CTG GCC TTC ATT GAC CTC AGA CAG 2563 

5 Gly Phe Gin Gly Asp Thr Leu Gin Leu Ala Phe He Asp Leu Arg Gin 

730 735 740 

CTC CTT GAC TTG TTT ATG GTT TGG GAC TGG TCT ACT TAC CTG GCT GAC 2611 
Leu Leu Asp Leu Phe Met Val Trp Asp Trp Ser Thr Tyr Leu Ala Asp 
10 745 750 755 

TAT GGG CAG CCA GCT TCC AAG TAC CTC CGG GTG AAC CCA CAC GCC GCC 2659 
Tyr Gly Gin Pro Ala Ser Lys Tyr Leu Arg Val Asn Pro His Ala Ala 
760 765 770 

15 

CTT ACG CTT TTG GAG AAG ATG AAG GAC ACC AGT AAG AAG AAT AAC ATA 2707 
Leu Thr Leu Leu Glu Lys Met Lys Asp Thr Ser Lys Lys Asn Asn He 

775 780 785 

20 TTT GCC CAG TTC CGG AAG AAC GAC CGG GAC AGA CAG AAG CTG ATC GAG 2755 
Phe Ala Gin Phe Arg Lys Asn Asp Arg Asp Arg Gin Lys Leu He Glu 
790 795 800 805 

ACA GTG GTG AAA CAG CTC AGA GGC TTG GTG ACC GGT ATG TCC CAG CAC 2803 
25 Thr Val Val Lys Gin Leu Arg Gly Leu Val Thr Gly Met Ser Gin His 

810 815 820 

ATG TAGACCAGAT CCATGCCCGA GTGCTTCTCT GTGGCCAGCA CCGGTCATGA GACAAT 2862 
Met 



30 



35 



AATTTGTTTA CAGAAGCCAA AAACTGTAGT AGAGGAAAGA TACTAATGAG GGCTTAAACA 2922 

ATAAGGACAG CAAACATCAT TTGTATGGAC TTTTAATAAC TGAATACTAT TTTATATATA 2982 

AAAATGTAAC ATTAATTTTT TCCAATTTTA GAAGAAAAGT CAAAAGTGTA ATATATAAAC 3042 

CCAATAAATT GTGTATG 3059 



(2) INFORMATION FOR SEQ ID NO: 60: 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 822 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:60: 

Met Pro Phe Ser Pro Gly Arg Arg Ala Arg Pro Ala Pro Pro Leu Pro 
15 10 15 

5 Ala Lys Met Ala Glu Ser Gly Glu Ala Leu Gly Thr Val Pro Glu His 
20 25 30 

Glu Arg lie Leu Gin Glu He Glu Ser Thr Asp Thr Ala Cys Val Gly 

35 40 45 

Pro Thr Leu Arg Ser Val Tyr Asp Gly Gin Pro Asn Ala His Lys Lys 
10 50 55 60 

Phe Met Glu Lys Leu Asp Ala Cys He Arg Asn His Asp Lys Glu He 
65 70 75 80 

Glu Lys Met Cys Asn Phe His His Gin Gly Phe Val Asp Ala He Thr 
85 90 95 

15 Glu Leu Leu Lys Val Arg Ala Asp Ala Glu Lys Leu Lys Val Gin Val 
100 105 HO 

Thr Asp Thr Asn Arg Arg Phe Gin Asp Ala Gly Lys Glu Val He Glu 

115 120 125 

Gin Thr Glu Asp He He Arg Cys Arg He Gin Gin Arg Asn He Thr 
20 130 135 140 

Thr Val Val Glu Lys Leu Gin Leu Cys Leu Pro Val Leu Glu Met Tyr 
145 150 155 160 

Ser Lys Leu Lys Glu Gin Met Ser Met Gin Arg Tyr Tyr Ser Ala Leu 
165 170 175 

25 Lys Thr Met Glu Gin Leu Glu Asn Val Tyr Phe Pro Arg Val Ser Gin 
180 185 190 

Tyr Arg Phe Cys Gin Leu Met Met Asp Thr Leu Pro Lys Leu Arg Glu 

195 200 205 

Asp He Lys Asp He Ser Met Ser Asp Leu Lys Asp Phe Leu Glu Ser 
30 210 215 220 

He Arg Lys His Ser Asp Lys He Gly Glu Thr Ala Met Lys Gin Ala 
225 230 235 240 

Gin Gin Gin Lys Ser Phe Ser He Ala Val Gin Lys Gin Thr Asn Met 
35 245 250 255 

Arg Phe Gly Lys Asn Met His Val Asn Asn Asp Arg Thr Leu Glu Glu 

260 265 270 

Lys Ser Asp He He Leu Lys His Thr Leu Glu Glu Glu Ala Glu Asn 
275 280 285 

40 Asp Glu Glu Val Leu Thr Val Gin Asp Leu Val Asp Phe Ser Pro Val 
290 295 300 

Tyr Arg Cys Ser His He Tyr Ser Ala Leu Gly Asp Glu Glu Thr Phe 
305 310 315 320 

Glu Asn Tyr Tyr Arg Lys Gin Arg Lys Lys Gin Ala Arg Leu Val Leu 
45 325 330 335 
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10 



15 



20 



25 



Gin Pro Gin Ser Ser Val His Glu Thr Val Asp Gly Tyr Arg Arg Tyr 

340 345 350 

Phe Thr Gin He Val Gly Phe Phe Val Val Glu Asp His He Leu His 

355 360 365 

Val Thr Gin Gly Leu Val Thr Arg Ala Tyr Thr Asp Glu Leu Trp Asn 

370 375 380 

Met Ala Leu Ser Lys He He Ala Val Leu Arg Ala His Ser Ser Tyr 
385 390 395 400 

Cys Thr Asp Pro Asp Leu Val Leu Glu Leu Lys Xaa Leu He Val He 

405 410 415 

Phe Ala Asp Thr Leu Gin Gly Tyr Gly Phe Ser Val Asn Arg Leu Phe 

420 425 430 

Asp Leu Leu Phe Glu He Arg Asp Gin Tyr Asn Glu Thr Leu Leu Lys 

435 440 445 

Lys Trp Ala Gly He Phe Arg Asp He Phe Glu Glu Asp Asn Tyr Ser 

450 455 460 

Pro He Pro He Gly Ser Glu Glu Glu Tyr Lys Met Val He Ser Lys 
465 470 475 480 

Phe Pro Phe Gin Asp Pro Asp Leu Glu Lys Gin Ser Phe Pro Lys Lys 

485 490 495 

Phe Pro Met Ser Gin Ser Val Pro Leu He Tyr He Gin Val Lys Glu 

500 505 510 

Phe He Tyr Ala Ser Leu Lys Phe Ser Glu Ser Leu His Arg Ser Ser 

515 520 525 

Thr Glu He Asp Asp Met Leu Arg Lys Ser Thr Asn Leu Leu Leu Thr 

530 535 540 

Arg He Leu Ser Ser Cys Leu Leu Asn Leu He Arg Lys Pro His He 
545 550 555 560 



30 



35 



40 



45 



Gly Leu Thr Glu Leu Val Gin He He He Asn Thr Thr His Leu Glu 

565 570 575 

Gin Ala Cys Lys Tyr Leu Glu Asp Phe He Thr Asn He Thr Asn He 

580 585 590 

Ser Gin Glu Thr Val His Thr Thr Arg Leu Tyr Gly Leu Ser Thr Phe 

595 600 605 

Lys Asp Ala Arg His Ala Ala Glu Gly Glu He Tyr Thr Lys Leu Asn 

610 615 620 

Gin Lys He Asp Glu Phe Val Gin Leu Ala Asp Tyr Asp Trp Thr Met 
625 630 635 640 

Ala Glu Ser Asp Gly Arg Ala Ser Gly Tyr Leu Met Asp Leu He Asn 

645 650 655 

Phe Leu Arg Ser He Phe Gin Val Phe Thr His Leu Pro Gly Lys Val 

660 665 670 

Ala Gin Thr Ala Cys Met Ser Ala Cys Gin His Leu Ser Thr Ser Leu 
675 680 685 
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Met Gin Met Leu Leu Asp Ser Glu Leu Lys Gin lie Ser Met Gly Ala 

690 695 700 

Val Gin Gin Phe Asn Leu Asp Val He Gin Cys Glu Leu Phe Ala Ser 
705 710 715 720 

Ser Glu Pro Val Pro Gly Phe Gin Gly Asp Thr Leu Gin Leu Ala Phe 

725 730 735 

He Asp Leu Arg Gin Leu Leu Asp Leu Phe Met Val Trp Asp Trp Ser 

740 745 750 

Thr Tyr Leu Ala Asp Tyr Gly Gin Pro Ala Ser Lys Tyr Leu Arg Val 

755 760 765 

Asn Pro His Ala Ala Leu Thr Leu Leu Glu Lys Met Lys Asp Thr Ser 

770 775 780 

Lys Lys Asn Asn He Phe Ala Gin Phe Arg Lys Asn Asp Arg Asp Arg 
785 790 795 800 

Gin Lys Leu He Glu Thr Val Val Lys Gin Leu Arg Gly Leu Val Thr 

805 810 815 

Gly Met Ser Gin His Met 



820 



20 



(2) INFORMATION FOR SEQ ID NO: 61: 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4358 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



<ii) 



MOLECULE TYPE: cDNA to mRNA 



30 



(iii) 



HYPOTHETICAL: NO 



(iv) 



ANTI- SENSE: NO 



35 



(vi) 



ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: p!02 (rsecS) 



40 



(ix) FEATURE: 

(A) NAME /KEY: Coding Sequence 

(B) LOCATION: 200... 2971 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 



45 
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ACTGCTGCGG CCCGGAGCGG GTTGGCCCGG ACCTGACGCC GCCTGCAAGC TGCAGGACAG 
60 

AGGGCTGGAG ATGGCAGATC CCAGGAGCTA GCAGCCAGCC CAGCCGAATG GTGTCTCCAC 120 
CTGGAGTGCA GGAGAGAGAG ACTGAAAACC AAAAAGGTTG CTTTTATAAC TGGAACAGGA 18 0 
5 GTTTGCTGCT CAAAGTACA ATG TCT CGA TCC CGG CAG CCT CCA CTT GTG ACA 232 

Met Ser Arg Ser Arg Gin Pro Pro Leu Val Thr 
15 10 

GGC ATC TCT CCA AAT GAA GGG ATT CCT TGG ACC AAA GTC ACA ATT AGA 280 
10 Gly lie Ser Pro Asn Glu Gly lie Pro Trp Thr Lys Val Thr He Arg 
15 20 25 

GGG GAG AAT CTG GGT ACT GGT CCC ACT GAC CTT ATA GGC TTG ACA ATT 328 
Gly Glu Asn Leu Gly Thr Gly Pro Thr Asp Leu He Gly Leu Thr He 
15 30 35 40 

TGT GGA CAT AAT TGC CTC CTC ACG GCA GAA TGG ATG TCT GCA AGT AAA 3 76 
Cys Gly His Asn Cys Leu Leu Thr Ala Glu Trp Met Ser Ala Ser Lys 
45 50 55 

20 

ATT GTC TGT CGA GTG GGA CAA GCC AAA AAT GAC AAA GGA GAC ATT ATT 424 
He Val Cys Arg Val Gly Gin Ala Lys Asn Asp Lys Gly Asp He He 
60 65 70 75 

25 GTC ACA ACC AAG TCA GGG GGC AGA GGA ACC TCA ACT GTC TCT TTT AAG 472 
Val Thr Thr Lys Ser Gly Gly Arg Gly Thr Ser Thr Val Ser Phe Lys 
80 85 90 

CTA CTC AAA CCT GAA AAA ATA GGC ATT TTG GAT CAG TCT GCA GTG TGG 520 
30 Leu Leu Lys Pro Glu Lys He Gly lie Leu Asp Gin Ser Ala Val Trp 
95 100 105 

GTG GAT GAG ATG AAT TAT TAT GAT ATG CGG ACT GAC AGG AAT AAG GGG 568 
Val Asp Glu Met Asn Tyr Tyr Asp Met Arg Thr Asp Arg Asn Lys Gly 
35 110 115 120 

ATT CCC CCC TTG TCA CTT CGT CCT GCG AAT CCT CTC GGC ATT GAA ATT 616 
He Pro Pro Leu Ser Leu Arg Pro Ala Asn Pro Leu Gly He Glu He 
125 130 135 



40 



GAG AAG TGT AAA CTT CCT CAG AAG AAC TTA GAA GTG TTA TTC CAT GGC 664 
Glu Lys Cys Lys Leu Pro Gin Lys Asn Leu Glu Val Leu Phe His Gly 
140 145 150 155 



45 



ATG AGT GCT GAT TTT ACG AGT GAG AAT TTT TCA GCA GCC TGG TAT CTC 



712 
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Met Ser Ala Asp Phe Thr Ser Glu Asn Phe Ser Ala Ala Trp Tyr Leu 
160 165 170 

ATA GAG AAC CAC TCC AAC ACC AGT TTT GAG CAG CTC AAG ATG GCA GTT 760 
5 lie Glu Asn His Ser Asn Thr Ser Phe Glu Gin Leu Lys Met Ala Val 
175 180 185 

ACC AAC CTC AAA AGG CAG GCC AAC AAG AAG AGC GAG GGT AGT CTA GCG 808 
Thr Asn Leu Lys Arg Gin Ala Asn Lys Lys Ser Glu Gly Ser Leu Ala 
190 195 200 

TAC GTG AAG GGT GGC CTC AGT ACC TTC TTT GAA GCT CAA GAT GCT CTC 856 
Tyr Val Lys Gly Gly Leu Ser Thr Phe Phe Glu Ala Gin Asp Ala Leu 
205 210 215 

TCA GCT ATC CAT CAA AAA CTA GAA GCA GAC GGA ACA GAA AAA GTA GAA 904 
Ser Ala lie His Gin Lys Leu Glu Ala Asp Gly Thr Glu Lys Val Glu 
220 225 230 235 

GGA TCC ATG ACA CAG AAA TTG GAG AAT GTC CTC AAC AGA GCA AGT AAT 952 
Gly Ser Met Thr Gin Lys Leu Glu Asn Val Leu Asn Arg Ala Ser Asn 
240 245 250 

ACT GCT GAC ACA TTG TTT CAA GAA GTG TTA GGT CGG AAA GAC AAG GCA 1000 
Thr Ala Asp Thr Leu Phe Gin Glu Val Leu Gly Arg Lys Asp Lys Ala 
255 260 265 

GAT TCC ACT AGG AAT GCA CTC AAC GTG CTT CAG CGG TTT AAA TTT CTC 1048 
Asp Ser Thr Arg Asn Ala Leu Asn Val Leu Gin Arg Phe Lys Phe Leu 
270 275 280 

TTC AAC CTT CCT CTC AAT ATC AAA CGG AAC ATT CAA AAG GGG GAT TAT 1096 
Phe Asn Leu Pro Leu Asn He Lys Arg Asn He Gin Lys Gly Asp Tyr 
285 290 295 

GAT GTG GTT ATT AAT GAT TAT GAA AAA GCC AAA TCG CTC TTT GGG AAA 1144 
Asp Val Val He Asn Asp Tyr Glu Lys Ala Lys Ser Leu Phe Gly Lys 
300 305 310 315 

ACG GAG GTG CAA GTT TTC AAG AAA TAT TAT GCT GAA GTA GAG GCA GGA 1192 
Thr Glu Val Gin Val Phe Lys Lys Tyr Tyr Ala Glu Val Glu Ala Gly 
320 325 330 

ATT GAA GAT CTA AGA GAG TTA CTT CTA AAG AAA CTG CTC GAG ACT CCG 1240 
He Glu Asp Leu Arg Glu Leu Leu Leu Lys Lys Leu Leu Glu Thr Pro 
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335 340 345 

TCA ACC TTA CAT GAC CAG AAG CGT TAC ATA AGG TAT CTT TCT GAC CTC 1288 
Ser Thr Leu His Asp Gin Lys Arg Tyr lie Arg Tyr Leu Ser Asp Leu 
5 350 355 360 

CAT GCT CCT GGT GAT CCT GCT TGG CAG TGT ATC GGA GCT CAG CAC AAG 1336 
His Ala Pro Gly Asp Pro Ala Trp Gin Cys lie Gly Ala Gin His Lys 
365 370 375 

10 

TGG ACC CTC AAA CTC ATG CAA GAC TGC AAA GAA GGC CAC ATG AAG AGC 1384 
Trp Thr Leu Lys Leu Met Gin Asp Cys Lys Glu Gly His Met Lys Ser 
380 385 390 395 

15 CTG AAA GGT AAC CCA GGC CCA CAC AGC CCC ATG CTT GAC CTA GAT AAT 1432 

Leu Lys Gly Asn Pro Gly Pro His Ser Pro Met Leu Asp Leu Asp Asn 
400 405 410 

20 GAT GCC CGC CCC TCT GTG TTG GGC CAT CTC AGT CAG ACA GCA TCA CTG 1480 
Asp Ala Arg Pro Ser Val Leu Gly His Leu Ser Gin Thr Ala Ser Leu 
415 420 425 

AAG AGA GGC AGC AGC TTC CAG TCT GGT CGA GAT GAC ACA TGG AGG TAC 1528 
25 Lys Arg Gly Ser Ser Phe Gin Ser Gly Arg Asp Asp Thr Trp Arg Tyr 
430 435 440 

AAG ACG CCC CAC AGA GTG GCA TTT GTT GAA AAA TTA ACG AAG CTT GTG 1576 
Lys Thr Pro His Arg Val Ala Phe Val Glu Lys Leu Thr Lys Leu Val 
30 445 450 455 

TTA AGT CAG CTG CCT AAC TTC TGG AAA CTC TGG ATT TCA TAT GTT AAT 1624 
Leu Ser Gin Leu Pro Asn Phe Trp Lys Leu Trp lie Ser Tyr Val Asn 
460 465 470 475 

35 

GGA AGT CTG TTC AGT GAG ACA GCT GAG AAG TCA GGC CAG ATT GAA AGA 1672 
Gly Ser Leu Phe Ser Glu Thr Ala Glu Lys Ser Gly Gin He Glu Arg 
480 485 490 

40 TCA AAG AAT GTG AGG CAA AGA CAA AAT GAT TTC AAG AAA ATG ATT CAA 1720 
Ser Lys Asn Val Arg Gin Arg Gin Asn Asp Phe Lys Lys Met He Gin 
495 500 505 



45 



GAA GTA ATG CAC TCC CTG GTG AAG TTG ATC CGG GGA GCC CTG CTC CCA 
Glu Val Met His Ser Leu Val Lys Leu He Arg Gly Ala Leu Leu Pro 



1768 



WO 98/28419 



PCT/US97/23498 



10 



15 



113 

510 515 520 

TTC AGC CTC CGA GAA GGG GAT GGG AGA CAG, TAT GGC GGC TGG GAG GTG 1816 
Phe Ser Leu Arg Glu Gly Asp Gly Arg Gin Tyr Gly Gly Trp Glu Val 
525 530 535 

CAG GCA GAG CTC TCG GGG CAG TGG CTC GCA CAC GTT ATC CAG ACC ATA 1864 
Gin Ala Glu Leu Ser Gly Gin Trp Leu Ala His Val lie Gin Thr lie 
540 545 550 555 

AGG CTG ACA TAT GAA TCA CTG ACC GCT CTG GAG ATT CCC AAT GAC ATG 1912 

Arg Leu Thr Tyr Glu Ser Leu Thr Ala Leu Glu He Pro Asn Asp Met 
560 565 570 

TTA CAG ATT ATC CAG GAC CTG ATT TTG GAT CTC CGT ATA CAT TGC ATA 1960 
Leu Gin He He Gin Asp Leu He Leu Asp Leu Arg He His Cys lie 
575 580 585 

20 ATG GTG ACG TTG CAA CAT ACA GCT GAA GAG ATA AAG AGA TTA GCT GAA 2008 
Met Val Thr Leu Gin His Thr Ala Glu Glu He Lys Arg Leu Ala Glu 
590 595 600 

AAA GAA GAT TGG ATT GTT GAT AAC GAA GGA CTG ACT TCC CTA CCA TGT 2056 
25 Lys Glu Asp Trp He Val Asp Asn Glu Gly Leu Thr Ser Leu Pro Cys 
605 610 615 

CAG TTT GAA CAG AGC ATT GTT CAC TCC CTG CAG TCG TTG AAG GGT GTT 2104 
Gin Phe Glu Gin Ser He Val His Ser Leu Gin Ser Leu Lys Gly Val 
30 620 625 630 635 

GTG GAT TGC AAA CCT GGA GAA GCC AGT GTC TTT CAG CAG CCT AAA ACC 2152 
Val Asp Cys Lys Pro Gly Glu Ala Ser Val Phe Gin Gin Pro Lys Thr 
640 645 650 

35 

CAG GAG GAG GTT TGC CAG TTG TGC ATC AGT ATC ATG CAG GTT TTT ATA 2200 
Gin Glu Glu Val Cys Gin Leu Cys He Ser He Met Gin Val Phe He 
655 660 665 

40 TAC TGC CTG GAA CAA CTG AGC ACC AAG CCT GAT GCG GAC ATA GAT ACT 2248 
Tyr Cys Leu Glu Gin Leu Ser Thr Lys Pro Asp Ala Asp He Asp Thr 
670 675 680 



45 



ACT CAT CTT TCT GTT GAT GTT TCT TCT CCT GAT TTG TTT GGA AGC ATC 
Thr His Leu Ser Val Asp Val Ser Ser Pro Asp Leu Phe Gly Ser He 
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685 690 695 

CAT GAA GAC TTC GGT TTG ACT TCA GAG CAG AGG CTT TTG ATA GTC CTG 2344 
His Glu Asp Phe Gly Leu Thr Ser Glu Gin Arg Leu Leu He Val Leu 
5 700 705 710 715 

AGT AAT TGC TGC TAT TTA GAA CGT CAC ACC TTC CTA AAC ATA GCA GAA 2392 

Ser Asn Cys Cys Tyr Leu Glu Arg His Thr Phe Leu Asn He Ala Glu 
10 720 725 730 

CAT TTT GAA AAG CAC AAC TTC CAG GGA ATA GAA AAA ATA ACA CAG GTT 2440 
His Phe Glu Lys "His Asn Phe Gin Gly He Glu Lys He Thr Gin Val 
735 740 745 

15 

AGT ATG GCA TCA CTG AAA GAA CTG GAT CAA CGA CTC TTT GAA AAT TAC 2488 
Ser Met Ala Ser Leu Lys Glu Leu Asp Gin Arg Leu Phe Giu Asn Tyr 
750 755 760 

20 ATT GAG CTA AAA GCA GAT CCT ATT GTT GGA TCC TTG GAA ACT GGG ATA 2536 
He Glu Leu Lys Ala Asp Pro He Val Gly Ser Leu Glu Thr Gly He 
765 770 775 

TAT GCA GGC TAT TTT GAT TGG AAA GAC TGT CTG CCT CCA GCA GGT GTC 2584 
25 Tyr Ala Gly Tyr Phe Asp Trp Lys Asp Cys Leu Pro Pro Ala Gly Val 
780 785 790 795 

AGA AAC TAT TTA AAA GAA GCA CTG GTG AAT ATA ATT GCT GTG CAT GCA 2632 
Arg Asn Tyr Leu Lys Glu Ala Leu Val Asn He He Ala Val His Ala 
30 800 805 810 

GAG GTG TTC ACT ATT TCC AAA GAA CTG GTG CCC CGG GTC CTG GCC AGA 2680 
Glu Val Phe Thr He Ser Lys Glu Leu Val Pro Arg Val Leu Ala Arg 
815 820 825 

35 

GTG ATA GAA GCA GTC TCT GAA GAG CTG AGC CGG CTG ATG CAG TGT GTC 2728 
Val He Glu Ala Val Ser Glu Glu Leu Ser Arg Leu Met Gin Cys Val 
830 835 840 

40 TCA TCC TTC AGC AGA AAT GGA GCG CTG CAG GCC AGA CTT GAA ATC TGT 2776 
Ser Ser Phe Ser Arg Asn Gly Ala Leu Gin Ala Arg Leu Glu He Cys 
845 850 855 



45 



GCT CTG AGG GAC ACT GTA GCC ATT TAC CTG ACC CCG GAG AGC AGG TCT 
Ala Leu Arg Asp Thr Val Ala He Tyr Leu Thr Pro Glu Ser Arg Ser 
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860 865 870 875 

AGT TTT AAA CAG GCC CTG GAG GCC CTG CCT CAG CTT GCA AGT GGT GCA 2872 

5 Ser Phe Lys Gin Ala Leu Glu Ala Leu Pro Gin Leu Ala Ser Gly Ala 
880 885 890 

GAC AAA AAG CTA CTG GAA GAG CTG CTC AAC AAG TTC AAG AGC AGT ATG 2920 
Asp Lys Lys Leu Leu Glu Glu Leu Leu Asn Lys Phe Lys Ser Ser Met 
10 895 900 905 

CAC TTG CAG CTC ACC TGC TTC CAA GCT GCG TCT CCA ACA GTG ATG AAG 2968 
His Leu Gin Leu Thr Cys Phe Gin Ala Ala Ser Pro Thr Val Met Lys 
910 915 920 

15 

ACA TAGACACAGA CAGAGATCCC AGTCCAAGCA CTGCATTTAC CTTCTCTCTT TTCCCC 3027 
Thr 

AGCATGGTGG TGCAGAAAGA TGTTGATGGG TATTTAATTT CAGCCGATTT AGACTACCAT 3087 

20 ACAATTTTGT TATTACTAGC TTTGGTCCTG TAGCATACTC TTCAAAATGT TTTTTCTTAG 3147 

CTCTATGTTT CTTGGTGTGA GTTTTGAGAG CTTCTCCAGC ATGGCCCAAA GCGGCCNATG 3207 

GTGTGTGTGA ATGTGCTNCG CAGTGGCGCT- CTACCTGGCC TCAGTGTTGG CAGTTTCCAG 3267 

CCCAGGGGAG GATATGAAGT CACAGCTTTT GGAGCTGTCA TAAACTTTAA ACCCACCATT 3327 

TCAAACAAGT GGTTCTAGGT GCGCTCTCTC TCTCTCTCTC TCTCTCTCTC TCTCTCTCTC 3387 

25 TGTAAAGAAA AATATAAAAA TGTTAAAATC TGTGATGCAG AAGTCCCTGG ATGAACTCTG 3447 

AATTTCATAG TCTGGCTTTC ATTTTTCCCA GTGTTCAACA CGGAAGCTTC TGGTTCTCAC 3507 

CCTTAAGTTG ATGCATCGTT GTGGGGAGGG TTTACAAGGA CCTGAATGTG GAGGCCCTGT 3567 

CACACCCGCA AGCCCTGTGG CAGGCTTGTG GTCTTCAATG CAACCTAAAG ATTAAGCCAG 3627 

GCTCCTCCCC TCCCGCAGGT TACTCCCCTC TGCTACTTTC CACACTTCGA AATGTCTTGC 3687 

30 TCCATAGCTG GAAAGATGGT GCCCTGGCAC AGGCTGTGTT TTGTAAAGAT TAGAGGTGCC 3747 

TCGGTGACCA GGCATTGAGC AGTGACTCAC TGCTCACTGG CTGGCACAGT AGAATCTGGT 3807 

CACTGCGACC TGCAAGCAGC TTTTACCTCA CTGCATCTGA CCCTAGCTTC AGCTCTGGTT 3867 

CACTCCAGAC AGATCTGTGG CGCCTTTCTT TCAGTTGACA AAATTCCTGT CACTTTTAGT 3927 

TCAGGTAATT CCAGAGCACA GCATATGCAC ACTATGGATT ATTTGAGGTT AATAACAGTG 3987 

35 TGTTTATTTA TCACAGTAAC TGGAAGTACA AATTTAATCC TTTGTTTGCA AAGTATGTAT 4047 

GTACAGCATT GTCTCCTCAG TGCACTGGTA TTACATCTGG CATAGCCAAA CTGAATTCCA 4107 

AGGCAGAAGT ACACAACTAT TGGTTGTTTA AACACCTAAC AAATACACTA CACAAAGCCA 4167 

GTAGTCTTCC TTCTCTGTAA AACCTCTAAG TGTCTTTATT TCACCATTTT TTATTAGCTC 4227 

TGTAATTGTT TATATTCCTA GTTTTAAAAT CCAAAATAAA TAGTGGTGTT CTCTCTTTAA 4287 

40 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 4347 

AAAAAAAAAA A 4358 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 924 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:62: 



10 Met Ser Arg Ser Arg Gin Pro Pro Leu Val Thr Gly lie Ser Pro Asn 
15 10 15 

Glu Gly He Pro Trp Thr Lys Val Thr He Arg Gly Glu Asn Leu Gly 

20 25 30 

Thr Gly Pro Thr Asp Leu He Gly Leu Thr He Cys Gly His Asn Cys 
15 35 40 45 

Leu Leu Thr Ala Glu Trp Met Ser Ala Ser Lys He Val Cys Arg Val 

50 55 60 

Gly Gin Ala Lys Asn Asp Lys Gly Asp lie lie Val Thr Thr Lys Ser 
65 70 75 80 

20 Gly Gly Arg Gly Thr Ser Thr Val Ser Phe Lys Leu Leu Lys Pro Glu 

85 90 95 

Lys He Gly He Leu Asp Gin Ser Ala Val Trp Val Asp Glu Met Asn 

100 105 HO 

Tyr Tyr Asp Met Arg Thr Asp Arg Asn Lys Gly He Pro Pro Leu Ser 
25 115 120 125 

Leu Arg Pro Ala Asn Pro Leu Gly He Glu He Glu Lys Cys Lys Leu 

130 135 140 

Pro Gin Lys Asn Leu Glu Val Leu Phe His Gly Met Ser Ala Asp Phe 
145 150 155 160 

30 Thr Ser Glu Asn Phe Ser Ala Ala Trp Tyr Leu He Glu Asn His Ser 

165 170 175 

Asn Thr Ser Phe Glu Gin Leu Lys Met Ala Val Thr Asn Leu Lys Arg 

180 185 190 

Gin Ala Asn Lys Lys Ser Glu Gly Ser Leu Ala Tyr Val Lys Gly Gly 
35 195 200 205 

Leu Ser Thr Phe Phe Glu Ala Gin Asp Ala Leu Ser Ala He His Gin 

210 215 220 

Lys Leu Glu Ala Asp Gly Thr Glu Lys Val Glu Gly Ser Met Thr Gin 
225 230 235 240 

40 Lys Leu Glu Asn Val Leu Asn Arg Ala Ser Asn Thr Ala Asp Thr Leu 

245 250 255 

Phe Gin Glu Val Leu Gly Arg Lys Asp Lys Ala Asp Ser Thr Arg Asn 

260 265 270 

Ala Leu Asn Val Leu Gin Arg Phe Lys Phe Leu Phe Asn Leu Pro Leu 
45 275 280 285 
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Asn He Lys Arg Asn He Gin Lys Gly Asp Tyr Asp Val Val He Asn 

290 295 300 

Asp Tyr Glu Lys Ala Lys Ser Leu Phe Gly Lys Thr Glu Val Gin Val 
305 310 315 320 

5 Phe Lys Lys Tyr Tyr Ala Glu Val Glu Ala Gly He Glu Asp Leu Arg 

325 330 335 

Glu Leu Leu Leu Lys Lys Leu Leu Glu Thr Pro Ser Thr Leu His Asp 

340 345 350 

Gin Lys Arg Tyr He Arg Tyr Leu Ser Asp Leu His Ala Pro Gly Asp 
10 355 360 365 

Pro Ala Trp Gin Cys He Gly Ala Gin His Lys Trp Thr Leu Lys Leu 

370 375 380 

Met Gin Asp Cys Lys Glu Gly His Met Lys Ser Leu Lys Gly Asn Pro 
385 390 395 400 

15 Gly Pro His Ser Pro Met Leu Asp Leu Asp Asn Asp Ala Arg Pro Ser 

405 410 415 

Val Leu Gly His Leu Ser Gin Thr Ala Ser Leu Lys Arg Gly Ser Ser 

420 425 430 

Phe Gin Ser Gly Arg Asp Asp Thr Trp Arg Tyr Lys Thr Pro His Arg 
20 435 440 445 

Val Ala Phe Val Glu Lys Leu Thr Lys Leu Val Leu Ser Gin Leu Pro 

450 455 460 

Asn Phe Trp Lys Leu Trp He Ser Tyr Val Asn Gly Ser Leu Phe Ser 
465 470 475 480 

25 Glu Thr Ala Glu Lys Ser Gly Gin He Glu Arg Ser Lys Asn Val Arg 

485 490 495 

Gin Arg Gin Asn Asp Phe Lys Lys Met He Gin Glu Val Met His Ser 
500 505 510 



30 Leu Val Lys Leu He Arg Gly Ala Leu Leu Pro Phe Ser Leu Arg Glu 
515 520 525 

Gly Asp Gly Arg Gin Tyr Gly Gly Trp Glu Val Gin Ala Glu Leu Ser 

530 535 540 

Gly Gin Trp Leu Ala His Val He Gin Thr He Arg Leu Thr Tyr Glu 
35 545 550 555 560 

Ser Leu Thr Ala Leu Glu He Pro Asn Asp Met Leu Gin He He Gin 

565 570 575 

Asp Leu He Leu Asp Leu Arg He His Cys He Met Val Thr Leu Gin 
580 585 590 

40 His Thr Ala Glu Glu He Lys Arg Leu Ala Glu Lys Glu Asp Trp He 
595 600 605 

Val Asp Asn Glu Gly Leu Thr Ser Leu Pro Cys Gin Phe Glu Gin Ser 

610 615 620 

He Val His Ser Leu Gin Ser Leu Lys Gly Val Val Asp Cys Lys Pro 
45 625 630 635 640 
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Gly Glu Ala Ser Val Phe Gin Gin Pro Lys Thr Gin Glu Glu Val Cys 

645 650 655 

Gin Leu Cys He Ser He Met Gin Val Phe He Tyr Cys Leu Glu Gin 
660 665 670 

5 Leu Ser Thr Lys Pro Asp Ala Asp He Asp Thr Thr His Leu Ser Val 
675 680 685 

Asp Val Ser Ser Pro Asp Leu Phe Gly Ser He His Glu Asp Phe Gly 

690 695 700 

Leu Thr Ser Glu Gin Arg Leu Leu He Val Leu Ser Asn Cys Cys Tyr 
10 705 710 715 720 

Leu Glu Arg His Thr Phe Leu Asn He Ala Glu His Phe Glu Lys His 

725 730 735 

Asn Phe Gin Gly He Glu Lys He Thr Gin Val Ser Met Ala Ser Leu 
740 745 750 

15 Lys Glu Leu Asp Gin Arg Leu Phe Glu Asn Tyr He Glu Leu Lys Ala 
755 760 765 

Asp Pro He Val Gly Ser Leu Glu Thr Gly He Tyr Ala Gly Tyr Phe 

770 775 780 

Asp Trp Lys Asp Cys Leu Pro Pro Ala Gly Val Arg Asn Tyr Leu Lys 
20 785 790 795 800 

Glu Ala Leu Val Asn He He Ala Val His Ala Glu Val Phe Thr He 

805 810 815 

Ser Lys Glu Leu Val Pro Arg Val Leu Ala Arg Val He Glu Ala Val 
820 825 830 

25 

Ser Glu Glu Leu Ser Arg Leu Met Gin Cys Val Ser Ser Phe Ser Arg 

835 840 845 

Asn Gly Ala Leu Gin Ala Arg Leu Glu He Cys Ala Leu Arg Asp Thr 
850 855 860 

30 Val Ala He Tyr Leu Thr Pro Glu Ser Arg Ser Ser Phe Lys Gin Ala 
865 870 875 880 

Leu Glu Ala Leu Pro Gin Leu Ala Ser Gly Ala Asp Lys Lys Leu Leu 

885 890 895 

Glu Glu Leu Leu Asn Lys Phe Lys Ser Ser Met His Leu Gin Leu Thr 
35 900 905 j 910 

Cys Phe Gin Ala Ala Ser Pro Thr Val Met Lys Thr 
915 920 
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IT IS CLAIMED: 

1. A substantially purified polypeptide selected from the group consisting 

of 

5 a. SA-17S p71, having a molecular weight of about 71 kD and having the 

sequence of SEQ ID NO:54; 

b. SA-17S p79, having a molecular weight of about 79 kD and having the 
sequence of SEQ ID NO:56; 

c. SA-17S p84, having a molecular weight of about 84 kD and having the 
10 sequence of SEQ ID NO:58; 

d. SA-17S p96, having a molecular weight of about 96 kD and having the 
sequence of SEQ ID NO:60; 

e. SA-17S pl02, having a molecular weight of about 102 kD and having 
the sequence of SEQ ID NO:62; 

15 f. SA-17S pl06, having a molecular weight of about 106 kD and 

characterized by sequences selected from the group consisting of SEQ ID NO:45, 
SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID 
NO:50, SEQ ID NO:51, and SEQ ID NO:52; and 

g. a polypeptide having (i) at least 60% sequence identity with a selected 
20 one of the SA-17S polypeptides (a)-(f), and (ii) the property of the selected 

polypeptide to form an SC complex-binding SA-17S complex when combined 
stoichiometrically with other polypeptide components of the SA-17S complex. 

2. The polypeptide of claim 1, which is selected from the group consisting 
25 of (aHe)and(h), 

h. a polypeptide having (i) at least 60% sequence identity with a selected 
one of the SA-17S polypeptides (aHe), and (ii) the property of the selected peptide 
to form an SC complex-binding SA-17S complex when combined stoichiometrically 
with other polypeptide components of the SA-17S complex. 

30 

3. The polypeptide of claim 2, which is polypeptide (a) or a polypeptide 
having (i) at least 60% sequence identity with polypeptide (a) and the property of 
the polypeptide (a) to form an SC complex-binding SA-17S complex when 
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combined stoichiometrically with other polypeptide components of the SA-17S 
complex. 

4. The polypeptide of claim 2, which is polypeptide (b) or a polypeptide 
having (i) at least 60% sequence identity with polypeptide (b) and the property of 
the polypeptide (b) to form an SC complex-binding SA-17S complex when 
combined stoichiometrically with other polypeptide components of the SA-17S 
complex. 

5. The polypeptide of claim 2, which is polypeptide (c) or a polypeptide 
having (i) at least 60% sequence identity with polypeptide (c) and the property of 
the polypeptide (c) to form an SC complex-binding SA-17S complex when 
combined stoichiometrically with other polypeptide components of the SA-17S 
complex. 

6. The polypeptide of claim 2, which is polypeptide (d) or a polypeptide 
having (i) at least 60% sequence identity with polypeptide (d) and the property of 
the polypeptide (d) to form an SC complex-binding SA-17S complex when 
combined stoichiometrically with other polypeptide components of the SA-17S 
complex. 

7. The polypeptide of claim 2, which is polypeptide (e) or a polypeptide 
having (i) at least 60% sequence identity with polypeptide (e) and the property of 
the polypeptide (e) to form an SC complex-binding SA-17S complex when 
combined stoichiometrically with other polypeptide components of the SA-17S 
complex. 

i 

8. A substantially purified polynucleotide encoding one of the polypeptides 
(a)-(g) in claim 1, said polynucleotide being selected from the group consisting of 

a. a polynucleotide having the sequence SEQ ID NO:53, which encodes the 
SA-17S p71 protein; 

b. a polynucleotide having the sequence SEQ ID NO:55, which encodes the 
SA-17S p79 protein; 
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c. a polynucleotide having the sequence SEQ ID NO:57, which encodes the 
SA-17S p84 protein; 

d. a polynucleotide sequence having the sequence SEQ ID NO:59, which 
encodes the SA-17S p96 protein; 

5 e. a polynucleotide having the sequence SEQ ID NO:61, which encodes the 

SA-17S pl02 protein; and 

f. a polynucleotide (i) having at least 60% sequence identity with a selected 

one of the SA-17S polynucleotides (a)-(e), and (ii) encoding a polypeptide having 

the property of the polypeptide encoded by the selected polynucleotide to form an 
10 SC complex-binding SA-17S complex when combined stoichiometrically with other 

polypeptide components of the SA-17S complex. 



9. The polynucleotide of claim 8, which is polynucleotide (a) or a 
polynucleotide having (i) at least 60% sequence identity with polynucleotide (a) and 
15 the property of the SA-17S p71 polypeptide encoded by polynucleotide (a) to form 
an SC complex-binding SA-17S complex when combined stoichiometrically with 
other polypeptide components of the SA-17S complex. 



10. The polynucleotide of claim 8, which is polynucleotide (b) or a 
20 polynucleotide having (i) at least 60% sequence identity with polynucleotide (b) and 
the property of the SA-17S p79 polypeptide encoded by polynucleotide (b) to form 
an SC complex-binding SA-17S complex when combined stoichiometrically with 
other polypeptide components of the SA-17S complex. 

25 11. The polynucleotide of claim 8, which is polynucleotide (c) or a 

polynucleotide having (i) at least 60% sequence identity with polynucleotide (c) and 
the property of the SA-17S p84 polypeptide encoded by polynucleotide (c) to form 
an SC complex-binding SA-17S complex when combined stoichiometrically with 
other polypeptide components of the SA-17S complex. 

30 

12. The polynucleotide of claim 8, which is polynucleotide (d) or a 
polynucleotide having (i) at least 60% sequence identity with polynucleotide (d) and 
the property of the SA-17S p96 polypeptide encoded by polynucleotide (d) to form 
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an SC complex-binding SA-17S complex when combined stoichiometrically with 
other polypeptide components of the SA-17S complex. 

13. The polynucleotide of claim 8, which is polynucleotide (e) or a 
5 polynucleotide having (i) at least 60% sequence identity with polynucleotide (e) and 
the property of the SA-17S pl02 polypeptide encoded by polynucleotide (e) to form 
an SC complex-binding SA-17S complex when combined stoichiometrically with 
other polypeptide components of the SA-17S complex. 

10 14. A method of identifying a compound capable of modulating secretion 

of secretory vesicles, comprising 

contacting a secretion associated 17S (SA-17S) complex containing a 
polypeptide from claim 1 with a syntaxin-containing (SC) complex in the presence 
and absence of a test compound, 
15 measuring the effect of the test compound on the extent of binding between 

the SA-17S and the SC complexes, and 

identifying said compound as effective if its measured effect on the extent 
of binding is above a threshold level. 

20 15. The method of claim 14, wherein said SA-17S complex binds to the 

7S SC complex or the 20S SC complex. 

16. The method of claim 14, wherein said secretory vesicles are 
neurotransmitter-containing synaptic vesicles. 

25 

17. The method of claim 16, wherein the test compound is effective to 
enhance binding of the SC complex with said SA-17S complex. 

18. The method of claim 16, wherein the test compound is effective to 
30 inhibit binding of the SC complex with said SA-17S complex. 
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19. A method for treating an affective disorder comprising administering 
to a subject in need of such treatment an effective amount of a compound identified 
in claim 17. 

5 20. A method for treating a neurodegenerative disease comprising 

administering to a subject in need of such treatment an effective amount of a 
compound identified in claim 17. 

21. A method for treating schizophrenia comprising administering to a 
10 subject in need of such treatment an effective amount of a compound identified in 
claim 18. 
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Fig. 3 A 
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GGLSTFFEAQDAISAI HQK 
2021.2 Da 

(SEQ ID NO:39) 
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rsec8 1. XAPEGPLIDVXNl T?nfT AA 

2. EFAAFFAK J? ly • 

3. XLGVQRPLLQSTSIXE 



p106 1. XXYGEIAXK 

2. XATVSLPEK 

3. XDYGVIAND 

4. LIKYFFMVASVK 

5. ELPEFNLHFFK 

6. XLODVDLASXR 

7. XNRXNEPAVNVL 

8. XQLXNIVEPEXIY 



Fig. 4B 



p102 1. YLSGLQAPGXPASQSIGAQ 

2. GGLSTFFEAQDALS Al HQK 

3. ASNTADTXRQER 

4. XRENYIEGGI 

5. ENLGRLFENYI 

6. XDYDVVINDYE 

7. XIPXLSTRPANP 



Fig. 4C 



p96 1. VVGQFPFQDTELEK 

2. XVYEIFDN 

3. DFLESIR 

4. XDQDLQLADYDHMT 

5. STNLLLTRTLXN 



Fig* 4D 
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rsec6 1. DFRQSINTIEXL 
. - _ 2. QGPSQASPNYXP 

Fig. 4E 3. AAIQSQLDGVRTGLSQ 

p84 1. QLSQQSDGDRDLQEWRQRVQALAEETAQYK 

2. XLQLSFNFSEPNRQRP 

3. SIPLALLPAAAAGA 

4. DAVXQNSTQAAETE N 

5. DYRNDEA 
FXCT. 4F 6. ENNPEEDDPS 

° 7. XLSQQSDXG 

8. AAALRAPPXVTS 

9. XKREPLE 

p79 1 . XTDYIAE 
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Fig. 6A 
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Fig. 7 A 
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Fig. 8A 
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